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Background of the Invention 

5 Pattern formation is the activity by which embryonic cells form ordered spatial 

arrangements of differentiated tissues. The physical complexity of higher organisms arises 
during embryogenesis through the interplay of cell-intrinsic lineage and cell-extrinsic 
signaling. Inductive interactions are essential to embryonic patterning in vertebrate 
development from the earliest establishment of the body plan, to the patterning of the organ 

) systems, to the generation of diverse cell types during tissue differentiation (Davidson, E., 
(1990) Development 108: 365-389; Gurdon, J. B., (1992) Cell 68: 185-199; Jessell, T. M. et 
al, (1992) Cell 68: 257-270). The effects of developmental cell interactions are varied. 
Typically, responding cells are diverted from one route of cell differentiation to another by 
inducing cells that differ from both the uninduced and induced states of the responding cells 

' (inductions). Sometimes cells induce their neighbors to differentiate like themselves 
(homoiogenetic induction); in other cases a cell inhibits its neighbors from differentiating 
like itself. Cell interactions in early development may be sequential, such that an initial 
induction between two cell types leads to a progressive amplification of diversity. Moreover, 
inductive interactions occur not only in embryos, but in adult cells as well, and can act to 

' establish and maintain morphogenetic patterns as well as induce differentiation (J.B. Gurdon 
(1992) Cell 68:185-199). 

The origin of the nervous system in all vertebrates can be traced to the end of 
gastrulation. At this time, the ectoderm in the dorsal side of the embryo changes its fate from 
epidermal to neural. The newly formed neuroectoderm thickens to form a flattened structure 
called the neural plate which is characterized, in some vertebrates, by a central groove (neural 
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groove) and thickened lateral edges (neural folds). At its early stages of differentiation, the 
neural plate already exhibits signs of regional differentiation along its anterior posterior (A-P) 
and mediolateral axis (M-L). The neural folds eventually fuse at the dorsal midline to form 
the neural tube which will differentiate into brain at its anterior end and spinal, cord at its 
5 posterior end. Closure of the neural tube creates dorsal/ventral differences by virtue of 
previous mediolateral differentiation. Thus, at the end of neurulation, the neural tube has a 
clear anterior-posterior (A-P), dorsal ventral (D-V) and mediolateral (M-L) polarities (see, for 
example, Principles in Neural Science (3rd), eds. Kandel, Schwartz and Jessell, Elsevier 
Science Publishing Company: NY, 1991; and Developmental Biology (3rd), ed. S.F. Gilbert, 
10 Sinauer Associates: Sunderland MA, 1991). Inductive interactions that define the fate of 
cells within the neural tube establish die initial pattern of the embryonic vertebrate nervous 
system. In the spinal cord, the identify of cell types is controlled, in part, by signals from two 
midline cell groups, the notochord and floor plate, that induce neural plate cells to 
differentiate into floor plate, motor neurons, and other ventral neuronal types (van Straaten et 
g 15 al. (1988) Anat Embryol 177:317-324; Placzek et al. (1993) Development 117:205-218; 
J Yamada et al. (1991) Cell 64:035-647; and Hatta et al. (1991) Nature 350:339-341). In 

*2 addition, signals from the floor plate are responsible for the orientation and direction of 

J commissural neuron outgrowth (Placzek, M. et al., (1990) Development 1 10: 19-30). Besides 

fp patterning the neural tube, the notochord and floorplate are also responsible for producing 

^ 20 signals which control the patterning of the somites by inhibiting differentiation of dorsal 
P somite derivatives in the ventral regions (Brand-Saberi, B. et al., (1993) Anat Embryol 188: 

^ 239-245; Porquie, O. et al., (1993) Proa Natl. Acad Set USA 90: 5242-5246). 

P 

W Another important signaling center exists in the posterior mesenchyme of developing 

limb buds, called the Zone of Polarizing Activity, or "ZPA rt . When tissue from the posterior 
25 region of the limb bud is grafted to the anterior border of a second limb bud, the resultant 
limb will develop with additional digits in a mirror-image sequence along the anteroposterior 
axis (Saunders and Gasseling, (1968) Epithelial-Mesenchymal Interaction, pp. 78-97). This 
finding has led to the model that the ZPA is responsible for normal anteroposterior patterning 
in the limb. The ZPA has been hypothesized to function by releasing a signal, termed a 
30 "morphogen", which forms a gradient across the early embryonic bud. According to this 
model, the fate of cells at different distances from the ZPA is determined by the local 
concentration of the morphogen, with specific thresholds of the morphogen inducing 
successive structures (Wolpert, (1969) Theor. Biol 25:1-47). This is supported by the 
finding that the extent of digit duplication is proportional to the number of implanted ZPA 
35 cells (Tickle, (1981) Nature 254:199-202). 

A candidate for the putative ZPA morphogen was identified by the discovery that a 
source of retinoic acid can result in the same type of mirror-image digit duplications when 
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placed in the anterior of a limb bud (Tickle et al., (1982) Nature 296:564-565; Summerbell, 
(1983) J. Embryol 78:269-289). The response to exogenous retinoic acid is concentration 
dependent as the morphogen model demands (Tickle et al., (1985) Dev. Biol. 109:82-95). 
Moreover, a differential distribution of retinoic acid exists across the limb bud, with a higher 
concentration in the ZPA region (Thaller and Eichele, (1987) Nature 327:625-628). 

Recent evidence, however, has indicated that retinoic acid is unlikely to be the 
endogenous factor responsible for ZPA activity (reviewed in Brockes, (1991) Nature 350:15; 
Tabin, (1991) Cell 66:199-217). It is now believed that rather than directly mimicking an 
endogenous signal, retinoic acid implants act by inducing an ectopic ZPA. The anterior limb 
tissue just distal to a retinoic acid implant and directly under the ectoderm has been 
demonstrated to acquire ZPA activity by serially transplanting that tissue to another limb bud 
(Summerbell and Harvey, (1983) Limb Development and Regeneration pp. 109-118; Wanek 
et al., (1991) Nature 350:81-83). Conversely, the tissue next to a ZPA graft does not gain 
p ZPA activity (Smith, (1979) J. Embryol 52:105-1 13). Exogenous retinoic acid would thus 

W 15 appear to act upstream of the ZPA in limb patterning. 

Nj The immediate downstream targets of ZPA action are not known. However, one 

jp important set of genes which are ectopically activated during ZPA-induced pattern 

jf duplications are the 5' genes of the Hoxd cluster. These genes are normally expressed in a 

nested pattern emanating from the posterior margin of the limb bud (Dolle et al., (1989) 
C 20 Nature 342:767-772; Izpisua-Belmonte et al., (1991) Nature 350:585-589). This nested 
^ pattern of Hox gene expression has been directly demonstrated to determine the identity of 

B the structures produced along the anteroposterior axis of the limb (Morgan et al., (1993) 

& Nature 358:236-239). As this would predict, ZPA grafts which produce mirror-image 

®" duplication of structures at an anatomical level first lead to the ectopic activation of the Hoxd 

25 genes in a mirror-image duplication at the molecular level. (Nohno et al., (1991) Cell 
64:1 197-1205; Izpisua-Belmonte et al., (1991) Nature 350:585-589). The molecular signals 
which regulate the expression of these important genes are currently not understood. 



Summary of the Invention 

30 The present invention relates to the discovery of a novel family of genes, and gene 

products, expressed in vertebrate organisms, which genes referred to hereinafter as the 
"hedgehog" gene family, the products of which are referred to as hedgehog proteins. The 
products of the hedgehog gene have apparent broad involvement in the formation and 
maintenance of ordered spatial arrangements of differentiated tissues in vertebrates, both 

35 adult and embryonic, and can be used to generate and/or maintain an array of different 
vertebrate tissue both in vitro and in vivo. 
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In general, the invention features hedgehog polypeptides, preferably substantially 
pure preparations of one or more of the subject hedgehog polypeptides. The invention also 
provides recombinantly produced hedgehog polypeptides. In preferred embodiments the 
polypeptide has a biological activity including: an ability to modulate proliferation, survival 
' 5 and/or differentiation of mesodermally-derived tissue, such as tissue derived from dorsal 
mesoderm; the ability to modulate proliferation, survival and/or differentiation of 
ectodermally-derived tissue, such as tissue derived from the neural tube, neural crest, or head 
mesenchyme; the ability to modulate proliferation, survival and/or differentiation of 
endodermaUy-derived tissue, such as tissue derived from the primitive gut Moreover, in 
1 0 preferred embodiments, the subject hedgehog proteins have the ability to induce expression 
of .secondary signaling molecules, such as members of the Transforming Growth Factor P 
family, as well as members of the fibroblast growth factor (FGF) family. 

In a preferred embodiment, the polypeptide is identical with or homologous to a Sonic 
w hedgehog (Shh) polypeptide, such as a mammalian Shh represented by SEQ ID Nos: 1 3 or 1 1 , 

3 15 an avian Shh represented by SEQ ID No: 8, or a fish Shh represented by SEQ ID No: 12. For 
2 instance, the Shh polypeptide preferably has an amino acid sequence at least 60% 

i homologous to a polypeptide represented by any of SEQ ID Nos: 8, 1 1, 12 or 13, though 

| polypeptides with higher sequence homologies of, for example, 80%, 90% or 95% are also 

tfj contemplated. Exemplary Shh proteins are represented by SEQ ID No. 40. The Shh 

* 20 polypeptide can comprise a full length protein, such as represented in the sequence listings, 
§ or it can comprise a fragment of, for instance, at least 5, 10, 20, 50, 100, 150 or 200 amino 

g acids in length. Preferred hedgehog polypeptides include Shh sequences corresponding 

approximately to the natural proteolytic fragments of the hedgehog proteins, such as from 
about Cys-24 through about the region that contains the proteolytic processing site, e.g., Ala- 
25 194 to Gly-203, or from about Cys-198 through Ala-475 of the human Shh protein, or 
analogous fragments thereto. 

In another preferred embodiment, the polypeptide is identical with or homologous to 
an Indian hedgehog {Ihh) polypeptide, such as a human Ihh represented by SEQ ID No: 14, or 
a mouse Ihh represented by SEQ ID No: 10. For instance, the Ihh polypeptide preferably has 
30 an amino acid sequence at least 60% homologous to a polypeptide represented by either of 
SEQ ID Nos: 10 or 14, though Ihh polypeptides with higher sequence homologies of, for 
example, 80%, 90% or 95% are also contemplated. The polypeptide can comprise the full 
length protein represented by in part by these sequences, or it can comprise a fragment of, for 
instance, at least 5, 10, 20, 50, 100, 150 or 200 amino acids in length. Preferred Ihh 
35 polypeptides comprise an N-terminal fragment from Cys-28 through the region that contains 
the proteolytic processing site, e.g., Ala-198 to Gly-207, or a C-terminal fragment from about 
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Cys-203 through Ser-411 of the mouse Ihh represented by SEQ ID No:10, or analogous 
fragments thereto. 

In still a further preferred embodiment, the polypeptide is identical with or 
homologous to a Desert hedgehog (Dhh) polypeptide, such as a mouse Dhh represented by 
5 SEQ ID No: 9. For instance, the Dhh polypeptide preferably has an amino acid sequence at 
least 60% homologous to a polypeptide represented by SEQ ID No: 9, though Dhh 
polypeptides with higher sequence homologies of, for example, 80%, 90% or 95% are also 
contemplated. The polypeptide can comprise the full length protein represented by this 
sequence, or it can comprise a fragment of, for instance, at least 5, 10, 20, 50, 100, 150 or 
10 200 amino acids in length. Preferred Dhh polypeptides comprise Dhh sequences 
corresponding to the N-terminal portion of the protein from about Cys-23 through about the 
region that contains the proteolytic processing site, e.g., Val-124 to Asn-203 or C-terminal 
fragment from about Cys-199 through Gly-396 of SEQ ID No:9, or analogous fragments 
thereto. 

J 15 in another preferred embodiment, the invention features a purified or recombinant 

H polypeptide fragment of a hedgehog protein, which polypeptide has the ability to modulate, 

jjl e.g., mimic or antagonize, a the activity of a wild-type hedgehog protein. Preferably, the 

P polypeptide fragment comprises a sequence identical or homologous to an amino acid 

f sequence designated in one of SEQ ID No:8, SEQ ID No:9, SEQ ID No:10, SEQ ID No:l 1, 

b 20 SEQ ID No:12, SEQ ID No:13, or SEQ ID No:14. More preferably, the polypeptide 
j|i fragment comprises an amino acid sequence designated in SEQ ID No: 40, e.g., includes the 

fl£ fragment of Cys-1 to Gly-174. 

g>, in y e t another preferred embodiment, the invention features a purified or recombinant 

polypeptide, which polypeptide has a molecular weight of approximately 19kDa and has the 
25 ability to modulate, e.g., mimic or antagonize, a the activity of a wild-type hedgehog protein. 
Preferably, the polypeptide comprises an amino acid sequence identical or homologous to an 
sequence designated in one of SEQ ID No:8, SEQ ID No:9, SEQ ID No:10, SEQ ID No:l 1, 
SEQ ID No:12, SEQ ID No:13, or SEQ ID No:14. More preferably, the polypeptide 
comprises an amino acid sequence designated in SEQ ID No:40. 

30 In still another preferred embodiment, the invention features a purified or 

recombinant hedgehog polypeptide comprising an amino acid sequence represented by the 
formula A-B wherein, A represents all or the portion of the amino acid sequence designated 
by residues 1-168 of SEQ ID No:40; and B represents at least one amino acid residue of the 
amino acid sequence designated by residues 169-221 of SEQ ID No:40; wherein A and B 

35 together represent a contiguous polypeptide sequence represented by SEQ ID No:40, and the 
polypeptide modulates, e.g., mimics or antagonizes, the biological activity of a hedgehog 
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protein. Preferably, B can represent at least 5, 10 or 20 amino acid residues of the amino acid 
sequence designated by residues 169-221 of SEQ ID No:40. 

In another embodiment, the invention features a purified or recombinant polypeptide 
comprising an amino acid sequence represented by the formula A-B, wherein A represents all 
5 or the portion of the amino acid sequence designated by residues 24-193 of SEQ ID No:13; 
and B represents at least one amino acid residue of the amino acid sequence designated by 
residues 194-250 of SEQ ID No:13; wherein A and B together represent a contiguous 
polypeptide sequence designated in SEQ ID No:13, and the polypeptide modulates, e.g., 
mimics or antagonizes, the biological activity of a hedgehog protein. 

10 • In yet another preferred embodiment, the invention features a purified or recombinant 
polypeptide comprising an amino acid sequence represented by the formula A-B, wherein A 
represents all or the portion of the amino acid sequence designated by residues 25-193, or 
analogous residues thereof, of a vertebrate hedgehog polypeptide identical or homologous to 
§ SEQ ID No: 1 1 ; and B represents at least one amino acid residue of the amino acid sequence 

^ 15 designated by residues 194-250, or analogous residues Ihereof, of a vertebrate hedgehog 
polypeptide identical or homologous to SEQ ID No:l 1; wherein A and B together represent a 
contiguous polypeptide sequence designated in SEQ ID No:ll, and the polypeptide 
modulates, e.g., agonizes or antagonizes, the biological activity of a hedgehog protein. 

In another embodiment, the invention features a purified or recombinant polypeptide 
S 20 comprising an amino acid sequence represented by the formula A-B, wherein A represents all 
g or the portion of the amino acid sequence designated by residues 23-193 of SEQ ID No:9; 

and B represents at least one amino acid residue of the amino acid sequence designated by 
residues 194-250 of SEQ ID No:9; wherein A and B together represent a contiguous 
polypeptide sequence designated in SEQ ID No:9, and the polypeptide modulates, e.g., 
25 agonizes or antagonizes, the biological activity of a hedgehog protein. 

In yet another embodiment, the invention features a purified or recombinant 
polypeptide comprising an amino acid sequence represented by the formula A-B, wherein A 
represents all or the portion of the amino acid sequence designated by residues 28-197 of 
SEQ ID No:10; and B represents at least one amino acid residue of the amino acid sequence 
30 designated by residues 198-250 of SEQ ID No:10; wherein A and B together represent a 
contiguous polypeptide sequence designated in SEQ ID No:10, and the polypeptide 
modulates, e.g., agonizes or antagonizes, the biological activity of a hedgehog protein. 

In yet a further preferred embodiment, the invention features a purified or 
recombinant polypeptide comprising an amino acid sequence represented by the formula A- 
35 B, wherein A represents all or the portion of the amino acid sequence designated by residues 
1-98, or analogous residues thereof, of a vertebrate hedgehog polypeptide identical or 
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homologous to SEQ ID No:14; and B represents at least one amino acid residue of the amino 
acid sequence designated by residues 99-150, or analogous residues thereof, of a vertebrate 
hedgehog polypeptide identical or homologous to SEQ ID No:14; wherein A and B together 
represent a contiguous polypeptide sequence designated in SEQ ID No:14, and the 
polypeptide modulates, e.g., agonizes or antagonizes, the biological activity of a hedgehog 
protein. 

In another preferred embodiment, the invention features a nucleic acid encoding a 
polypeptide fragment of a hedgehog protein, e.g. a fragment described above. Preferably, the 
polypeptide fragment comprises an amino acid sequence identical or homologous with a 
sequence designated in one of SEQ ID No:8, SEQ ID No:9, SEQ ID No:10, SEQ ID No:l 1, 
SEQ ID No:12, SEQ ID No:13, or SEQ ID No:14. More preferably, the polypeptide 
fragment comprises an amino acid sequence designated in SEQ ID No:40. 

In yet another preferred embodiment, the invention features a nucleic acid encoding a 
§ polypeptide, which polypeptide has a molecular weight of approximately 1 9 kDa and has the 

p 15 ability to modulate, e.g., either mimic or antagonize, atleast a portion of the activity of a 
wild-type hedgehog protein. Preferably, the polypeptide comprises an amino acid sequence 
5 identical or homologous with a sequence designated in one of SEQ ID No:8, SEQ ID No:9, 

SEQ ID No:10, SEQ ID No:l 1, SEQ ID No:12, SEQ ID No:13, or SEQ ID No:14. More 
preferably, the polypeptide comprises an amino acid sequence designated in the general 
20 formula SEQ ID No:40. 

In another preferred embodiment, the invention feature a nucleic acid which encodes 
a polypeptide that modulates, e.g., mimics or antagonizes, the biological activity of a 
hedgehog protein, which nucleic acid comprises all or a portion of the nucleotide sequence of 
the coding region of a gene identical or homologous to the nucleotide sequence designated by 
25 one of SEQ ID No:l, SEQ ID No:2, SEQ ID No:3, SEQ ID No:4, SEQ ID No:5, SEQ ID 
No:6 or SEQ ID No:7. Preferably, the nucleic acid comprises a hedgehog-encoding portion 
that hybridizes under stringent conditions to a coding portion of one or more of the nucleic • 
acids designated by SEQ ID No: 1-7. 

Moreover, as described below, the hedgehog polypeptide can be either an agonist 
30 (e.g. mimics), or alternatively, an antagonist of a biological activity of a naturally occurring 
form of the protein, e.g., the polypeptide is able to modulate difTerentiation and/or growth 
and/or survival of a cell responsive to authentic hedgehog proteins. Homologs of the subject 
hedgehog proteins include versions of the protein which are resistant to proteolytic cleavage, 
as for example, due to mutations which alter potential cleavage sequences or which inactivate 
35 an enzymatic activity associated with the protein. Other forms are secreted and isolatable 
from a cell with no further proteolytic cleavage required beyond cleavage of a signal 
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sequence, e.g., truncated forms of the protein, such as corresponding to the natural proteolytic 
fragments described below. 

The hedgehog polypeptides of the present invention can be glycosylated, or 
conversely, by choice of the expression system or by modification of the protein sequence to 
5 preclude glycosylation, reduced carbohydrate analogs can also be provided. Glycosylated 
forms include derivatization with glycosaminoglycan chains. Likewise, hedgehog 
polypeptides can be generated which lack an endogenous signal sequence (though this is 
typically cleaved off even if present in the pro-form of the protein). 

The subject proteins can also be provided as chimeric molecules, such as in the form 
10 of'fusion proteins. For instance, the hedgehog protein can be provided as a recombinant 
fusion protein which includes a second polypeptide portion, e.g., a second polypeptide 
having an amino acid sequence unrelated (heterologous) to the hedgehog polypeptide, e.g. 
the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide 
§ portion is an enzymatic activity such as alkaline phosphatase, e.g. the second polypeptide 

£H 1 5 portion is an epitope tag. 

M 

Jf! Yet another aspect of the present invention concerns an immunogen comprising a 

p hedgehog polypeptide in an immunogenic preparation, the immunogen being capable of 

iC eliciting an immune response specific for a hedgehog polypeptide; e.g. a humoral response, 
e.g. an antibody response; e.g. a cellular response. In preferred embodiments, the immunogen 

Sj 20 comprising an antigenic determinant, e.g. a unique determinant, from a protein represented 

0 byoneofSEQIDNos.8-14. 

*0 A still further aspect of the present invention features antibodies and antibody 

^ preparations specifically reactive with an epitope of the hedgehog immunogen. 

In another preferred embodiment, the invention features a nucleic acid encoding a 
25 polypeptide fragment of a hedgehog protein, e.g. a fragment described above. Preferably, the 
polypeptide fragment comprises an amino acid- sequence identical or homologous with a 
sequence designated in one of SEQ ID No:8, SEQ ID No:9, SEQ ID No:10, SEQ ID Norll, 
SEQ ID No:12, SEQ ID No:13, or SEQ ID No:14. More preferably, the polypeptide 
fragment comprises an amino acid sequence designated in SEQ ID No:40. 

30 In yet another preferred embodiment, the invention features a nucleic acid encoding a 

polypeptide, which polypeptide has a molecular weight of approximately 19 kDa and has the 
ability to modulate, e.g., either mimic or antagonize, atleast a portion of the activity of a 
wild-type hedgehog protein. Preferably, the polypeptide comprises an amino acid sequence 
identical or homologous with a sequence designated in one of SEQ ID No:8, SEQ ID No:9, 

35 SEQ ID No:10, SEQ ID Norll, SEQ ID No:12, SEQ ID No:13, or SEQ ID No:14. More 
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preferably, the polypeptide comprises an amino acid sequence designated in the general 
formula SEQ ID No:40. 

In another preferred embodiment, the invention feature a nucleic acid which encodes 
a polypeptide that modulates, e.g., mimics or antagonizes, the biological activity of a 
- 5 Tiedgehog protein, which nucleic acid comprises all or a portion of the nucleotide sequence of 
'* the coding region of a gene identical or homologous to the nucleotide sequence designated by 

one of SEQ ID No:l, SEQ ID No:2, SEQ ID No:3, SEQ ID No:4, SEQ ID No:5, SEQ ID 
No:6 or SEQ ID No:7. Preferably, the nucleic acid comprises a hedgehog-encoding portion 
that hybridizes under stringent conditions to a coding portion of one or more of the nucleic 
10 acids designated by SEQ ID No: 1 -7. 

Another aspect of the present invention provides a substantially isolated nucleic acid 
having a nucleotide sequence which encodes a hedgehog polypeptide. In preferred 
embodiments, the encoded polypeptide specifically mimics or antagonizes inductive events 
j| mediated by wild-type hedgehog proteins. The coding sequence of the nucleic acid can 

p 15 comprise a sequence which is identical to a coding sequence represented in one of SEQ ID 
Nos: 1-7, or it can merely be homologous to one or more of those sequences. For instance, 
the hedgehog encoding sequence preferably has a sequence at least 60% homologous to a 
nucleotide sequence in one or more of SEQ ID Nos: 1-7, though higher sequence homologies 
of, for example, 80%, 90% or 95% are also contemplated. The polypeptide encoded by the 
C 20 nucleic acid can comprise an amino acid sequence represented in one of SEQ ID Nos: 8-14 
U such as one of those fall length proteins, or it can comprise a fragment of that nucleic acid, 

Pj which fragment may, for instance, encode a fragment which is, for example, at least 5, 10, 20, 

*3 50 or 100 or 200 amino acids in length. The polypeptide encoded by the nucleic acid can be 

either an agonist (e.g. mimics), or alternatively, an antagonist of a biological activity of a 
25 naturally occurring form of a hedgehog protein. 

Furthermore, in certain preferred embodiments, the subject hedgehog nucleic acid 
will include a transcriptional regulatory sequence, e.g. at least one of a transcriptional 
promoter or transcriptional enhancer sequence, which regulatory sequence is operably linked 
to the hedgehog gene sequence. Such regulatory sequences can be used in to render the 
30 hedgehog gene sequence suitable for use as an expression vector. 

In yet a further preferred embodiment, the nucleic acid hybridizes under stringent 
conditions to a nucleic acid probe corresponding to at least 12 consecutive nucleotides of 
either sense or antisense sequence of one or more of SEQ ID Nos: 1 -7; though preferably to at 
least 20 consecutive nucleotides; and more preferably to at least 40, 50 or 75 consecutive 
35 nucleotides of either sense or antisense sequence of one or more of SEQ ID Nos:l-7. 
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The invention also features transgenic non-human animals, e.g. mice, rats, rabbits, 
chickens, frogs or pigs, having a transgene, e.g., animals which include (and preferably 
express) a heterologous form of a hedgehog gene described herein, or which misexpress an 
endogenous hedgehog gene, e.g., an animal in which expression of one or more of the subject 
'5 hedgehog proteins is disrupted. Such a transgenic animal can serve as an animal model for 
studying cellular and tissue disorders comprising mutated or mis-expressed hedgehog alleles 
or for use in drug screening. 

The invention also provides a probe/primer comprising a substantially purified 
oligonucleotide, wherein the oligonucleotide comprises a region of nucleotide sequence 
1 0~ which hybridizes under stringent conditions to at least 1 0 consecutive nucleotides of sense or 
antisense sequence of SEQ ID No:l, or naturally occurring mutants thereof. Nucleic acid 
probes which are specific for each of the classes of vertebrate hedgehog proteins are 
contemplated by the present invention, e.g. probes which can discern between nucleic acid 
_ encoding an Shh versus an Ihh versus a Dhh versus an Mhh. In preferred embodiments, the 

W 15 probe/primer further includes a label group attached thereto and able to be detected. The 
SJ label group can be selected, e.g., from a group consisting of radioisotopes, fluorescent 

i compounds, enzymes, and enzyme co-factors. Probes of the invention can be used as a part 

of a diagnostic test kit for identifying dysfunctions associated with mis-expression of a 
hedgehog protein, such as for detecting in a sample of cells isolated from a patient, a level of 
JL, 20 a nucleic acid encoding a subject hedgehog protein; e.g. measuring a hedgehog mRNA level 
in a cell, or determining whether a genomic hedgehog gene has been mutated or deleted. 
These so called "probes/primers" of the invention can also be used as a part of "antisense" 
therapy which refers to administration or in situ generation of oligonucleotide probes or their 
S derivatives which specifically hybridize (e.g. bind) under cellular conditions, with the cellular 

25 mRNA and/or genomic DNA encoding one or more of the subject hedgehog proteins so as to 
inhibit expression of that protein, e.g. by inhibiting transcription and/or translation. 
Preferably, the oligonucleotide is at least 10 nucleotides in length, though primers of 20, 30, 
50, 100, or 1 50 nucleotides in length are also contemplated. 

In yet another aspect, the invention provides an assay for screening test compounds 
30 for inhibitors, or alternatively, potentiators, of an interaction between a hedgehog protein and 
a hedgehog receptor. An exemplary method includes the steps of (a) forming a reaction 
niixture mcluding: (i) a AcoTge/iog polypeptide, (ii) a pafc/rerf polypeptide, and (iii) a test 
compound; and (b) detecting interaction of the hedgehog and patched polypeptides. A 
statistically significant change (potentiation or inhibition) in the interaction of the hedgehog 
35 and patched polypeptides in the presence of the test compound, relative to the interaction in 
the absence of the test compound, indicates a potential agonist (mimetic or potentiator) or 
antagonist (inhibitor) of hedgehog bioactivity for the test compound. The reaction mixture 
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can be a cell-free protein preparation, e.g., a reconsistuted protein mixture or a cell lysate, or 
it can be a recombinant cell including a heterologous nucleic acid recombinantly expressing 
the patched polypeptide. 

In preferred embodiments, the step of detecting interaction of the hedgehog and 
5 patched polypeptides is a competitive binding assay. In other preferred embodiments, the 
step of detecting interaction of the hedgehog and patched polypeptides involves detecting, in 
a cell-based assay, change(s) in the level of an intracellular second messenger responsive to 
signaling by the patched polypeptide. In still another preferred embodiment, the step of 
detecting interaction of the hedgehog and patched polypeptides comprises detecting, in a 
10 cell-based assay, change(s) in the level of expression of a gene controlled by a transcriptional 
regulatory sequence responsive to signaling by the patched polypeptide. 

In one exemplary embodiment, the present invention provides an assay for screening 
test compounds to identify agents which modulate the binding of hedgehog proteins with a 
P hedgehog receptor, comprising: (i) combining, as a cell-free system, a hedgehog polypeptide, 

I 15 a hedgehog receptor polypeptide, and a test compound; and (ii) detecting formation of a 
5 complex comprising the hedgehog and receptor polypeptides. A statistically significant 

i change in the formation of the complex in the presence of the test compound is indicative of 

S an agent that modulates interaction between hedgehog proteins and a cognate hedgehog 

5 receptor. The cell-free system can be, e.g., a cell membrane preparation, a reconstituted 
^ 20 protein mixture, or a Uposome reconstituting the receptor polypeptide as a hedgehog receptor. 
*j In preferred embodiments, at least one of the hedgehog polypeptide and the receptor 
g polypeptide comprises a detectable label, and interaction of the hedgehog and receptor 
| polypeptides is quantified by detecting the label in the complex. The detectable label can be, 

6 e.g., a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. In other 
25 embodiments, the complex is detected by an immunoassay. In preferred embodiments, the 

receptor is a patched polypeptide. In preferred embodiments, the above assay further 
includes the step of contacting the compound, which produced statistically significant change 
in the formation of the complex, with a cell expressing a hedgehog receptor and detennining 
if the compound can cause a phenotypic change in the cell. 
' 30 Yet another exemplary embodiment provides an assay for screening test compounds 

to identify agents which modulate the binding of hedgehog proteins with a hedgehog 
receptor, e.g., a patched receptor, comprising: (i) providing a cell expressing a hedgehog 
receptor; (ii) contacting the cell with a hedgehog polypeptide and a test compound; and (Hi) 
detecting interaction of the hedgehog polypeptide and receptor. A statistically significant 
35 change in the level of interaction of the hedgehog polypeptide and receptor is indicative of an 
agent that modulates the interaction of hedgehog proteins with a hedgehog receptor. The 
interaction of the hedgehog polypeptide and receptor can be detected, e.g., by detecting 
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change in phenotype of the cell relative to the absence of the test compound. The change is 
phenotype may be, to illustrate, a gain or loss of expression of a cell-type specific marker. 

In other embodiments, the receptor transduces a signal in the cell which is sensitive to 
hedgehog binding, and the cell further comprises a reporter gene construct comprising a 
5 reporter gene in operable linkage with a transcriptional regulatory sequence sensitive to 
intracellular signals transduced by interaction of the hedgehog polypeptide and receptor, 
expression of the reporter gene providing a detectable signal for detecting interaction of the 
hedgehog polypeptide and receptor. The reporter gene can encode, e.g., a gene product that 
gives rise to a detectable signal such as: color, fluorescence, luminescence, cell viability 
10 relief of a cell nutritional requirement, cell growth, and drug resistance. For example, the 
reporter gene can encode a gene product selected from the group consisting of 
chloramphenicol acetyl transferase, luciferase, beta-galactosidase and alkaline phosphatase. 

In preferred embodiments, the transcriptional regulatory sequence which controls 
O expression of the reporter gene is from a GLI gene and patched gene. 

P 15 In still other embodiments, the receptor transduces a signal in the cell which is 

N sensitive to hedgehog binding, and interaction of the hedgehog polypeptide and receptor are 

j§. detected by detecting change in the level of an intracellular second messenger responsive to 
P signaling by the receptor. For example, interaction of the hedgehog polypeptide and receptor 

* can be detected by changes in intracellular protein phosphorylation. 

^20 In preferred embodiments of the cell-based assay formats, the cell includes a 

n heterologous gene construct encoding the receptor. Moreover, the cell can also include one 

ffl or more heterologous gene constructs encoding, e.g., costal-2, fused and/or smoothened 

|p genes, or homologs thereof. 

In preferred embodiments, the patched polypeptide is a drosophila/wrfctec? protein, or 
25 a vertebrate homolog thereof. In more preferred embodiments, the patched protein is of 
mammaUan origin, e.g., the patched polypeptide is human patched protein. Moreover, the 
the patched polypeptide can be a recombinant polypeptide. 

The hedgehog polypeptide used in the assays of the instant invention is also 
preferably of vertebrate origin, e.g., of mammalian origin, e.g., the hedgehog polypeptide is 
30 human hedgehog protein. The hedgehog polypeptide is preferably a recombinant 
polypeptide. 

For the cell-based assays, the recombinant cell is preferably a metazoan cell, e.g., a 
mammalian cell, e.g., an insect cell, e.g., a xenopus cell, e.g., an oocyte. In other 
embodiments, the hedgehog receptor can be reconsituted in a yeast cell. 
35 In preferred embodiments, the steps of the assay are repeated for a variegated library 

of at least 100 different test compounds, more preferably at least 1Q3, 10* or 10* different test 
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compounds. The test compound can be, e.g., a peptide, a nucleic acid, a carbohydrate, a 
small organic molecule, or natural product extract (or fraction thereof). 

Another aspect of the present invention provides a recombinant cell, e.g., for carrying 
out certain of the drug screening methods above, comprising: (i) an expressible recombinant 
5 gene encoding a heterologous patched polypeptide whose signal transduction activity is 
modulated by binding to a hedgehog protein; and (ii) a reporter gene construct containing a 
reporter gene in operative linkage with one or more transcriptional regulatory elements 
responsive to the signal transduction activity of the cell patched polypeptide. Still another 
aspect of the present invention provides a kit for screening test compounds to identify 
1 0 agents which modulate the binding of hedgehog proteins with a hedgehog receptor, including 
the above-referenced cell and a preparation of purified hedgehog polypeptide. 

In still another aspect, the present invention provides an assay for identifying 
compounds which inhibit the proteolytic activity of a hedgehog protein, comprising: (a) 
q forming a reaction mixture including: (i) a hedgehog protein having an endogenous 

0 15 proteolytic activity, (ii) a substrate for the hedgehog proteolytic activity, and (iii) a test 
% compound; and (b) determining the rate of conversion of the substrate to product by the 
£ hedgehog proteolytic activity. A statistically significant decrease in the rate of substrate 
SJ conversion in the presence of the test compound, relative to the absence of the test 
Jo compound, indicates a that the test compound is an inhibitor of the proteolytic activity of the 

* 20 hedgehog protein. 

P 

H yet another aspect of the present invention concerns a method for modulating one or 

jy more of growth, differentiation, or survival of a mammalian cell responsive to hedgehog 

iD induction. In general, whether carries out in vivo, in vitro, or in situ, the method comprises 

01 treating the cell with an effective amount of a hedgehog polypeptide so as to alter, relative to 
25 the cell in the absence of hedgehog treatment, at least one of (i) rate of growth, (ii) 

differentiation, or (iii) survival of the cell. Accordingly, the method can be carried out with 
polypeptides mimics the effects of a naturally-occurring hedgehog protein on the cell, as well 
as with polypeptides which antagonize the effects of a naturally-occurring hedgehog protein 
on said cell. In preferred embodiments, the hedgehog polypeptide provided in the subject 

30 method are derived from verterbrate sources, e.g., are vertebrate hedgehog polypeptides. For 
instance, preferred polypeptides includes an amino acid sequence identical or homologous to 
an amino acid sequence (e.g., including bioactive fragments) designated in one of SEQ ID 
No:8, SEQ ID No:9, SEQ ID No:10, SEQ ID No:l 1, SEQ ID No:12, SEQ ID No:13 or SEQ 
ID No: 14. Furthermore, the present invention contemplates the use of invertebrate hedgehog 

35 polypeptides, such as the Dros-HH polypeptide designated by SEQ ID No:34, or bioactive 
fragments thereof equivalent to the subject vertebrate fragments. 
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In one embodiment, the subject method includes the treatment of testicular cells, so as 
modulate spermatogenesis. In another embodiment, the subject method is used to modulate 
osteogenesis, comprising the treatment of osteogenic cells with a hedgehog polypeptide. 
Liekwise, where the treated cell is a chondrogenic cell, the present method is used to 
5 modulate chondrogenesis. In still another embodiment, hedgehog polypeptides can be used 
to modulate the differentiation of neural cells, e.g., the method can be used to cause 
differentiation of a neuronal cell, to maintain a neuronal cell in a differentiated state, and/or 
to enhance the survival of a neuronal cell, e.g., to prevent apoptosis or other forms of cell 
death. For instance, the present method can be used to affect the differentiation of such 
10 neuronal cells as motor neurons, cholinergic neurons, dopanergic neurons, serotenergic 
neurons, and peptidergic neurons. 

The present method is applicable, for example, to cell culture technique, such as in 
the culturing of neural and other cells whose survival or differentiative state is dependent on 
hedgehog function. Moreover, hedgehog agonists and antagonists can be used for therapeutic 
ffl 15 intervention, such as to enhance survival and maintenance of neurons and other neural cells 
JH in both the central nervous system and the peripheral nervous system, as well as to influence 

£ other vertebrate organogenic pathways, such as other ectodermal patterning, as well as 

^ certain mesodermal and endodermal differentiation processes. In an exemplary embodiment, 

Jp the method is practiced for modulating, in an animal, cell growth, cell differentiation or cell 

^ 20 survival, and comprises administering a therapeutically effective amount of a hedgehog 
polypeptide to alter, relative the absence of hedgehog treatment, at least one of (i) rate of 
growth, (ii) differentiation, or (iii) survival of one or more cell-types in the animal. 



is Another aspect of the present invention provides a method of determining if a subject, 

^ e.g. a human patient, is at risk for a disorder characterized by unwanted cell proliferation or 

25 aberrant control of differentiation. The method includes detecting, in a tissue of the subject, 
the presence or absence of a genetic lesion characterized by at least one of (i) a mutation of a 
gene encoding a hedgehog protein, e.g. represented in SEQ ID No: 2, or a homolog thereof; 
or (ii) the mis-expression of a hedgehog gene. In preferred embodiments, detecting the 
genetic lesion includes ascertaining the existence of at least one of: a deletion of one or more 
30 nucleotides from a hedgehog gene; an addition of one or more nucleotides to the gene, a 
substitution of one or more nucleotides of the gene, a gross chromosomal rearrangement of 
the gene; an alteration in the level of a messenger RNA transcript of the gene; the presence of 
a non-wild type splicing pattern of a messenger RNA transcript of the gene; or a non-wild 
type level of the protein. 

35 For example, detecting the genetic lesion can include (i) providing a probe/primer 

including an oligonucleotide containing a region of nucleotide sequence which hybridizes to 
a sense or antisense sequence of a hedgehog gene, e.g. a nucleic acid represented in one of 
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SEQ ID Nos: 1-7, or naturally occurring mutants thereof, or 5' or 3' flanking sequences 
naturally associated with the hedgehog gene; (ii) exposing the probe/primer to nucleic acid of 
the tissue; and (iii) detecting, by hybridization of the probe/primer to the nucleic acid, the 
presence or absence of the genetic lesion; e.g. wherein detecting the lesion comprises 
5 utilizing the probe/primer to determine the nucleotide sequence of the hedgehog gene and, 
optionally, of the flanking nucleic acid sequences. For instance, the probe/primer can be 
employed in a polymerase chain reaction (PCR) or in a ligation chain reaction (LCR). In 
alternate embodiments, the level of a hedgehog protein is detected in an immunoassay using 
an antibody which is specifically immunoreactive with the hedgehog protein. 

10 The practice of the present invention will employ, unless otherwise indicated, 

conventional techniques of cell biology, cell culture, molecular biology, transgenic biology, 
microbiology, recombinant DNA, and immunology, which are within the skill of the art. 
Such techniques are explained fully in the literature. See, for example, Molecular Cloning A 
n Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor 

PS 15 Laboratory Press: 1989); DNA Cloning, Volumes I and U (D. N.. Glover ed., 1985); 
frj Oligonucleotide Synthesis (M. J. Gait ed., 1984); Mullis et al. U.S. Patent No: 4,683,195; 

% Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); Transcription And 

£ Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells (R. I. 

| Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 1986); B. 

t 20 Perbal, A Practical Guide To Molecular Cloning (1984); the treatise, Methods In Enzymology 
P, (Academic Press, Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H. Miller and 

p M. P. Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In Enzymology, Vols. 154 

IU and 1 55 (Wu et al. eds.), Immunochemical Methods In Cell And Molecular Biology (Mayer 

$ and Walker, eds., Academic Press, London, 1987); Handbook Of Experimental Immunology, 

' 25 Volumes I-IV (D. M. Weir and C. C. Blackwell, eds., 1986); Manipulating the Mouse 
Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). 

Other features and advantages of the invention will be apparent from the following 
detailed description, and from the claims. 



30 Brief Description of the Drawings 

Figure 1 represents the amino acid sequences of two chick hh clones, chicken 
hedgehog-A (pCHA; SEQ ID No:35) and chicken hedgehog-B (pCHB; SEQ ID No:36). 
These clones were obtained using degenerate primers corresponding to the underlined amino 
acid residues of the Drosophila sequence (corresponding to residues 161-232 of SEQ ID 
35 No:34) also shown in Figure 1 , followed by nested PCR using chicken genomic DNA. 
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Figure 2 is an alignment comparing the amino acid sequences of chick Shh (SEQ ID 
No:8) with its Drosophila homolog (SEQ ID No:34). Shh residues 1-26 correspond to the 
proposed signal peptide. Identical residues are enclosed by boxes and gaps in order to 
highlight similarity. The nucleotide sequence of Shh has been submitted to Genbank. 

5 Figure 3 is a hydropathy plot for the predicted chick Shh protein, generated by the 

methods of Kyte and Doolittle (1982). The values of hydrophobicity are plotted against the 
amino acid positions. Negative values predict a hydrophobic domain of the protein. 

Figure 4 is an alignment comparing the amino acid sequences of various hh proteins. 
The white region on the amino terminus of chicken Shh corresponds to the putative signal 
10 peptide. The black box refers to a highly conserved region from aa residues 26-207 of SEQ 
ID No:8). The arrows point to exon boundaries in the Drosophila gene (Lee et al. (1992) Cell 
71: 33-50). In each case, the proteins are compared to chicken Shh (SEQ ID No:8) and the 
percent amino acid identity is indicated in each region's box. 

Q 

p5 Figure 5A is a "pileup" alignment of predicted amino acid sequences which compares 

J! 15 Drosophila hh (D-hh; SEQ ID No:34), mouse hh (M^Dhh; SEQ ID No:9; M-Ihh; SEQ ID 
| No:10; M-Shh; SEQ ID No:l 1), chicken hh (C-Shh; SEQ ID No:8), and zebrafish hh {Z-Shh; 

V* SEQ ID No:12). The predicted hydrophobic transmembrane/signal sequences are indicated 

§ in italics and the predicted signal sequence processing site is arrowed. The positions of 

r introns interrupting the Drosophila hh and U-Dhh open reading frames are indicated by 

8 20 arrowheads. All amino acids shared among the six predicted hh proteins are indicated in 
Q bold. Figure 5B is a sequence alignment of the N-terminal portion of vertebrate hedgehog 

^ proteins, and the predicted degenerate sequence "CON" (SEQ ID No: 41). 

A* Figure 6 is an inter- and cross-species comparison of amino acid identities among the 

predicted processed hh proteins shown in Figure 5A. All values are percentages. Figures in 
25 parentheses represent sirnilarities allowing for conservative amino acid substitutions. 

Figure 7 is a representation of the DNA constructs used in transgenic studies to study 
ectopic expression of chick Shh in mouse embryos. Constructs were generated for ectopic 
expression of cDNA clones in the Wnt-l expression domain and tested in transgenic mice 
embryos using a lac-Z reporter (pWEXP-tocZ (used as a control)) and a chick Shh reporter 
30 (pWEXP-CShh). The pWEXP-CShh construct contained two tandem head to tail copies of a 
chick Shh cDNA. The results of WEXP2-CShh transgenic studies are shown in Table 1 . 

Figure 8 is a model for anterioposterior limb patterning and the Zone of Polarizing 
Activity (ZPA), based on Saunders and Gasseling (1968). The left portion of the diagram 
schematizes a stage 20 limb bud. The somites are illustrated as blocks along the left margin 
35 of the limb bud; right portion of the same panel illustrates the mature wing. The hatched 
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region on the posterior limb is the ZPA. Normally, the developed wing contains three digits 
II, III, and IV. The figure further shows the result of transplanting a ZPA from one limb bud 
to the anterior margin of another. The mature limb now contains six digits IV, III, II, n, HI, 
and IV in a mirror-image duplication of the normal pattern. The large arrows in both panels 
5 represent the signal produced by the ZPA which acts to specify digit identity. 

Figures 9A and 9B illustrate the comparison of zebrafish Shh (Z-Shh) and Drosophila 
hh (hh) amino acid sequences. Figure 9A is an alignment of zebrafish Shh and Drosophila hh 
amino acid sequences. Identical amino acids are linked by vertical bars. Dots indicate gaps 
introduced for optimal alignment Putative transmembrane/signal peptide sequences are 
10 underlined (Kyte and Doolittle (1982) J Mol Biol 157:133-148). The position of exon 
boundaries in the Drosophila gene are indicated by arrowheads. The region of highest 
similarity between Z-Shh and hh overlaps exon 2. Figure 9B is a schematic comparison of Z- 
Shh and drosophila hh. Black boxes indicate the position of the putative 
transmembrane/signal peptide sequences, relative to the ammo-terminus. Sequence 
p 15 homologies were scored by taking into account the alignment of chemically similar amino 
|| acids and percentage of homology in the boxed regions is indicated. 

Figure 10 is an alignment of partial predicted amino acid sequences from three 
ih different zebrafish hh homologs. One of these sequences corresponds to Shh, while the other 

® two define additional hh homologs in zebrafish, named hh(a) and hhQ>). Amino acid 

? 20 identities among the three partial homologs are indicated by vertical bars. 

a 

M Figure 11 is a schematic representations of chick and mouse Shh proteins. The 



putative signal peptides and Asn-linked glycosylation sites are shown. The numbers refer to 
jg amino acid positions. 

Figure 12 is a schematic representation of myc-tagged Shh constructs. The positions 
25 of the c-myc epitope tags are shown, as is the predicted position of the proteolytic cleavage 
site. The shaded area following the signal peptide of the carboxy terminal tagged construct 
represents the region included in the Glutathione~S-transferase fusion protein used to generate 
antisera in rabbits. 

Figure 13 is a schematic diagram of Shh processing. Illustrated are cleavage of the 
30 signal peptide (black box), glycosylation at the predicted Asn residue (N), and the secondary 
proteolytic cleavage. The question marks indicate that the precise site of proteolytic cleavage 
has not been determined. The different symbols representing the carbohydrate moiety 
indicated maturation of this structure in the Golgi apparatus. The dashed arrow leading from 
the signal peptide cleaved protein indicates that secretion of this species may be an artifact of 
35 the incomplete proteolytic processing of Shh seen in Xenopus oocytes and cos cells. 
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Figure 14 is a schematic diagram of a model for the coordinated growth and 
patterning of the limb. Sonic is proposed to signal directly to the mesoderm to induce 
expression of the Hoxd and Bmp-2 genes. The induction of these mesodermal genes requires 
5 competence signals from the overlying AER. One such signal is apparently Fgf-4. 
Expression of Fgf-4 in the AER can be induced by Sonic providing an indirect signaling 
pathway from Sonic to the mesoderm. FGFs also maintain expression of Sonic in the ZPA, 
thereby completing a positive feedback loop which controls the relative positions of the 
signaling centers. While Fgf-4 provides competence signals to the mesoderm, it also 
10 promotes mesodermal proliferation. Thus patterning of the mesoderm is dependent on the 
same signals which promote its proliferation. This mechanism inextricably integrates limb 
patterning with outgrowth. 

Figures 15A and B are schematic diagrams of patterning of the Drosophila and 
15 vertebrate gut. Regulatory interactions responsible for patterning of Drosophila midgut (Fig. 
p 15 A) are compared to a model for patterning of the vertebrate hindgut (Fig. 15B) based on 

expression data. Morphologic regional distinctions are indicated to the left (A and B), genes 
J expressed in the visceral mesoderm are in the center panel, those in the gut lumenal 

£j endoderm are on the right. HOMIHox gene expression domains are boxed. Regionally 

*B 20 expressing secreted gene products are indicated by lines. Arrows indicate activating 
jp interactions, barred lines, inhibiting interactions. Regulatory interactions in Drosophila gut 

v 4 (A) have been established by genetic studies except for the relationship between dpp and 

jjj: hedgehog, which is hypothesized based on their interactions in the Drosophila imaginal 

discs, hedgehog appears to be a signal from the endoderm to the mesoderm, and that dpp is 
P 25 expressed in the mesoderm. 

Figure 16 is a schematic diagram of chromosomal locations of Ihh, Shh and Dhh in 
the mouse genome. The loci were mapped by interspecific backcross analysis. The 
segregation patterns of the loci and flanking genes in backcross animals that were typed for 

30 all loci are shown above the chromosome maps. For individual pairs of loci more animals 
were typed. Each column represents the chromosome identified in the backcross progeny 
that was inherited from the (C57BL/6J x M. spretus) Fl parent. The shaded boxes represent 
the presence of a C57BL/6J allele and white boxes represent the presence of a M. spretus 
allele. The number of the offsprings inheriting each type of chromosome is listed at the 

35 bottom of each column. Partial chromosome linkage maps showing location of Ihh, Shh and 
Dhh in relation too linked genes is shown. The number of recombinant N2 animals is 
presented over total number of N 2 animals typed to the left of the chromosome maps between 
each pair of loci. The recombinant frequencies, expressed as genetic distance in 
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centimorgans (+ one standard error) are also shown. When no recombination between loci 
was detected, the upper 95% confidence limit of the recombination distance is indicated in 
parentheses. Gene order was determined by minimizing the number of recombinant events 
required to explain the allele distribution patterns. The position of loci in human 
chromosomes can be obtained from GDB (Genome Data Base), a computerized database of 
human linkage information maintained by the William H. Welch Medical Library of the John 
Hopkins University (Baltimore, MD). 

Figure 17 is a graph depicting saturable, concentration-dependent binding of human 
Shh to chick patched expressed in Xenopus laevis oocytes . 

Figure 18 is a bar graph depicting the specificity of human Shh binding to the chick 
patched. 

Figure 19 is a graph depicting the timecourse of human Shh binding to chick patched. 

Figure 20 is a graph depicting the dissociation rate of human Shh binding to chick 
patched. 

Figure 21 is a bar graph depicting the effect of glycosylation on the binding of Shh to 
patched. 

Figure 22 is a schematic diagram of a proposed topological model of the mouse 
patched protein. (Goodrich et aL (1996), Genes & Development 10(3): 301-10) 

Detailed Description of the Invention 

Of particular importance in the development and maintenance of tissue in vertebrate 
animals is a type of extracellular communication called induction, which occurs between 
neighboring cell layers and tissues (Saxen et aL (1989) IntJDev Biol 33:21-48; and Gurdon 
et aL (1987) Development 99:285-306). In inductive interactions, chemical signals secreted 
by one cell population influence the developmental fate of a second cell population. 
Typically, cells responding to the inductive signals are diverted from one cell fate to another, 
neither of which is the same as the fate of the signaling cells. 

Inductive signals are key regulatory proteins that function in vertebrate pattern 
formation, and are present in important signaling centers known to operatex embryonically, 
for example, to define the organization of the vertebrate embryo. For example, these 
signaling structures include the notochord, a transient structure which initiates the formation 
of the nervous system and helps to define the different types of neurons within it. The 
notochord also regulates mesodermal patterning along the body axis. Another distinct group 
of cells having apparent signaling activity is the floorplate of the neural tube (the precursor of 
the spinal cord and brain) which also signals the differentiation of different nerve cell types. 
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It is also generally believed that the region of mesoderm at the bottom of the buds which 
form the limbs (called the Zone of Polarizing Activity or ZPA) operates as a signaling center 
by secreting a morphogen which ultimately produces the correct patterning of the developing 
limbs. 



The present invention concerns the discovery that polypeptides encoded by a family 
of vertebrate genes, termed here hedgehog genes, comprise the signals produced by these 
embryonic patterning centers. As described herein, each of the disclosed vertebrate hedgehog 
(hh) homologs exhibits spatially and temporally restricted expression domains indicative of 
important roles in embryonic patterning. For instance, the results provided below indicate 
10 that vertebrate hh genes are expressed in the posterior limb bud, Hensen's node, the early 
notochord, the floor plate of the neural tube, the fore- and hindgut and their derivatives, 
j These are all important signaling centers known to be required for proper patterning of 
surrounding embryonic tissues. 

! 

Cj The hedgehog family of vertebrate inter-cellular signaling molecules provided by the 

^ 15 present invention consists of at least four members. Three of these members, herein referred 
Si to as Desert hedgehog (Dhh), Sonic hedgehog (Shh) and Indian hedgehog (Ihh), apparently 

Jfj exist in all vertebrates, including fish, birds, and mammals. A fourth member, herein referred 

□ to as Moonrat hedgehog (Mhh) 9 appears specific to fish. According to the appended sequence 

^ listing, (see also Table 1) a chicken Shh polypeptide is encoded by SEQ ID No:l; a mouse 

p 20 Dhh polypeptide is encoded by SEQ ID No:2; a mouse Ihh polypeptide is encoded by SEQ 
Sj ID No:3; a mouse Shh polypeptide is encoded by SEQ ID No:4 a zebrafish Shh polypeptide is 

^ encoded by SEQ ID No:5; a human Shh polypeptide is encoded by SEQ ID No:6; and a 

5 human Ihh polypeptide is encoded by SEQ ID No:7. 



25 Table 1 

Guide to hedgehog sequences in Sequence Listing 

Nucleotide Amino Acid 



Chicken Shh SEQ ID No. 1 SEQ ID No. 8 

Mouse Dhh SEQ ID No. 2 SEQ ID No. 9 

Mouse Ihh SEQ ID No. 3 SEQ ID No. 10 

Mouse Shh SEQ ID No. 4 SEQ ID No. 1 1 

Zebrafish Shh SEQ ID No. 5 SEQ ID No. 12 

Human Shh SEQ ID No. 6 SEQ ID No. 13 

Human Ihh SEQ ID No. 7 SEQ ID No. 14 

Certain of the vertebrate hedgehog (hh) proteins of the present invention are defined 
by SEQ ID Nos:8-14 and can be cloned from vertebrate organisms including fish, avian and 
30 mammalian sources. These proteins are distinct from the drosophila hedgehog protein 
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which, for clarity, will be referred to hereinafter as "Dros-HH". In addition to the sequence 
variation between the various hh homologs, the vertebrate hedgehog proteins are apparently 
present naturally in a number of different forms, including a pro-form, a full-length mature 
form, and several processed fragments thereof. The pro-form includes an N-terminal signal 
5 peptide for directed secretion of the extracellular domain, while the full-length mature form 

(lacks this signal sequence. Further processing of the mature form apparently occurs in some 
instances to yield biologically active fragments of the protein. For instance, sonic hedgehog 
undergoes additional proteolytic processing to yield two peptides of approximately 19 kDa 
and 27 kDa, both of which are secreted. In addition to proteolytic fragmentation, the 
10 vertebrate hedgehog proteins can also be modified post-translationally, such as by 
glycosylation, though bacterially produced (e.g. unglycosylated) forms of the proteins 
apparently still maintain some of the activity of the native protein. 

t As described in the following examples, the cDNA clones provided by the present 

p invention were first obtained by screening a mouse genomic library with a partial Drosophila 

p 15 hh cDNA clone (.7kb). Positive plaques were identified and one mouse clone was selected. 
f\ This clone was then used as a probe to obtain a genomic clone containing the full coding 
sequence of the Mouse Dhh gene. As described in the attached Examples, Northern blots and 
Ifl in situ hybridization demonstrated that Mouse Dhh is expressed in the testes, and potentially 

the ovaries, and is also associated with sensory neurons of the head and trunk. Interestingly, 
20 no expression was detected on the nerve cell bodies themselves (only the axons), indicating 
that Dhh is likely produced by the Shwann cells. 



> — 5 

In order to obtain cDNA clones encoding chicken hh genes, degenerate 

oligonucleotides were designed corresponding to the amino and carboxy ends of Drosophila 
^ hh exon 2. As described in the Examples below, these oligonucleotides were used to isolate 

I 25 PCR fragments from chicken genomic DNA. These fragments were then cloned and 

sequenced. Ten clones yielded two different hh homologs, chicken Dhh and chicken Shh. 

The chicken Shh clone was then used to screen a stage 21/22 limb bud cDNA library which 

yielded a full length Shh clone. 

In order to identify other vertebrate hedgehog homologs, the chicken clones {Dhh and 
30 Shh) were used to probe a genomic southern blot containing chicken DNA. As described 
below, genomic DNA was cut with various enzymes which do not cleave within the probe 
sequences. The DNA was run on a gel and transferred to a nylon filter. Probes were derived 
by ligating each 220 bp clone into a concatomer and then labeling with a random primer kit. 
The blots were hybridized and washed at low stringency. In each case, three hybridizing 
35 bands were observed following autoradiography, one of which was significantly more intense 
(a different band with each probe), indicating that there are at least three vertebrate hh genes. 
Additional cDNA and genomic screens carried out have yielded clones of three hh homologs 
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from chickens and mice (Shh, Dhh and Ihh\ and four hh homologs from zebrafish (Shh, Dhh, 
Ihh and Mhh\ Weaker hybridization signals suggested that the gene family may be even 
larger. Moreover, a number of weakly hybridizing genomic clones have been isolated. 
Subsequently, the same probes derived from chicken hedgehog homologs have been utilized 
5 to screen a human genomic library. PCR fragments derived from the human genomic library 
were then sequenced, and PCR probes derived from the human sequences were used to screen 
human fetal cDNA libraries. Full-length cDNA encoding human sonic hedgehog protein 
(Shh) and partial cDNA encoding human Indian hedgehog protein (Ihh) were isolated from 
the fetal library, and represent a source of recombinant human hedgehog proteins. 

10 To order to determine the expression patterns of the various vertebrate hh homologs, 

in situ hybridizations were performed in developing embryos of chicken, mice and fish. As 
described in the Examples below, the resulting expression patterns of each hh homolog were 
similar across each species and revealed that hh genes are expressed in a number of important 
embryonic signaling centers. For example, Shh is expressed in Hensen's node, the notochord, 
3 1 5 the ventral floorplate of the developing neural tube, and the ZPA at the base of the limb buds; 
Cf| Ihh is expressed in the embryonic yolksac and hindgut, and appear also to be involved in 

j« chondrogenesis; Dhh is expressed in the testes; and Mhh (only in zebrafish) is expressed in 

U1 the notochord and in certain cranial nerves. 

a 

Furthermore, experimental evidence indicates that certain hedgehog proteins initiate 
ff% 20 expression of secondary signaling molecules, including Bmp-2 (a TGF-p relative) in the 

mesoderm and Fgf-4 in the ectoderm. The mesoderm requires ectodermally-derived 
jy competence factors), which include Fgf-4, to activate target gene expression in response to 

hedgehog signaling. The expression of, for example, Sonic and Fgf-4 is coordinately 
9* regulated by a positive feedback loop operating between the posterior mesoderm and the 

25 overlying AER, which is the ridge of pseudostratified epithelium extending antero-posteriorly 
* along the distal margin of the bud. These data provide a basis for understanding the 

r * integration of growth and patterning in the developing limb which can have important 

implications in the treatment of bone disorders described in greater detail herein. 

i 

To determine the role hedgehog proteins plays in inductive interactions between the 
30 endoderm and mesoderm, which are critical to gut morphogenesis, in situ hybridizations and 
recombinant retroviral injections were performed in developing chick embryos. The ventral 
mesoderm is induced to undergo gut-specific differentiation by the adjacent endoderm. As 
described in Examples below, at the earliest stages of chick gut formation Shh is expressed by 
the endoderm, and BMP-4 (a TGF-P relative) is expressed in the adjacent visceral mesoderm. 
35 Ectopic expression of Sonic is sufficient to induce expression of BMP-4 in visceral 
mesoderm, suggesting that Sonic serves as an inductive signal from the endoderm to the 
mesoderm. Subsequent organ-specific endodermal differentiation depends on regional 
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inductive signal from the visceral mesoderm. Hox genes are expressed in the undifferentiated 
chick hind gut mesoderm with boundaries corresponding to morphologic borders, suggesting 
a role in regulating gut morphogenesis. 

Bioactive fragments of hedgehog polypeptides of the present invention have been 
generated and are described in great detail in USSN 08/435,093, filed May 4 1995, herein 
incorporated by reference. 

Accordingly, certain aspects of the present invention relate to nucleic acids encoding 
vertebrate hedgehog proteins, the hedgehog proteins themselves, antibodies immunoreactive 
with hh proteins, and preparations of such compositions. Moreover, the present invention 
provides diagnostic and therapeutic assays and reagents for detecting and treating disorders 
involving, for example, aberrant expression of vertebrate hedgehog homology In addition, 
drug discovery assays are provided for identifying agents which can modulate the binding of 
vertebrate hedgehog homologues to hedgehog-binding moieties (such as hedgehog receptors, 
ligands, or other extracellular matrix components). Such agents can be useful therapeutically 
to alter the growth and/or differentiation of a cell. Other aspects of the invention are 
described below or will be apparent to those skilled in the art in light of the present 
disclosure. 

For convenience, certain terms employed in the specification, examples, and 
appended claims are collected here. 

As used herein, the term "nucleic acid" refers to polynucleotides such as 
deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The term 
should also be understood to include, as equivalents, analogs of either RNA or DNA made 
from nucleotide analogs, and, as applicable to the embodiment being described, single (sense 
or antisense) and double-stranded polynucleotides. 

As used herein, the term "gene" or "recombinant gene" refers to a nucleic acid 
comprising an open reading frame encoding one of the vertebrate hh polypeptides of the 
present invention, including both exon and (optionally) intron sequences. A "recombinant 
gene" refers to nucleic acid encoding a vertebrate hh polypeptide and comprising vertebrate 
A/i-encoding exon sequences, though it may optionally include intron sequences which are 
either derived from a chromosomal vertebrate hh gene or from an unrelated chromosomal 
gene. Exemplary recombinant genes encoding the subject vertebrate hh polypeptide are 
represented by SEQ ID No:l, SEQ ID No:2, SEQ ID No:3, SEQ ID No:4, SEQ ID No:5, 
SEQ ID No:6 or SEQ ID No:7. The term "intron" refers to a DNA sequence present in a 
given vertebrate hh gene which is not translated into protein and is generally found between 
exons. 
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As used herein, the term "transfection" means the introduction of a nucleic acid, 
e.g., an expression vector, into a recipient cell by nucleic acid-mediated gene transfer. 
"Transformation", as used herein, refers to a process in which a cell's genotype is changed as 
a result of the cellular uptake of exogenous DNA or RNA, and, for example, the transformed 
5 cell expresses a recombinant form of a vertebrate hh polypeptide or, where anti-sense 
expression occurs from the transferred gene, the expression of a naturally-occurring form of 
the vertebrate hh protein is disrupted. 

As used herein the term "bioactive fragment of a hedgehog protein" refers to a 
fragment of a hedgehog polypeptide, wherein the encoded polypeptide specifically agonizes 
10 or antagonizes inductive events mediated by wild-type hedgehog proteins. The hedgehog 
: \ biactive fragment preferably is, for example, at least 5, 10, 20, 50, 100, 150 or-200 amino 
; acids in length. 

* An "effective amount" of a hedgehog polypeptide, or a bioactive fragment thereof, 
O with respect to the subject method of treatment, refers to an amount of agonist or antagonist 
ft 15 in a preparation which, when applied as part of a desired dosage regimen, provides 
N modulation of growth, differentiation or survival of cells, e.g., modulation of 
^ spermatogenesis, skeletogenesis, e.g., osteogenesis, chondrogenesis, or limb patterning, or 

Q neuronal differentiation. 

y3 

* As used herein, "phenotype" refers to the entire physical, biochemical, and 
^ 20 physiological makeup of a cell, e.g., having any one trait or any group of traits. 

^ The tenns "induction" or "induce", as relating to the biological activity of a hedgehog 

i3 protein, refers generally to the process or act of causing to occur a specific effect on the 

^ phenotype of cell. Such effect can be in the form of causing a change in the phenotype, e.g., 

differentiation to another cell phenotype, or can be in the form of maintaining the cell in a 
25 particular cell, e.g., preventing dedifferentation or promoting survival of a cell. 

As used herein the term "animaT refers to mammals, preferably mammals such as 
live stock or humans. Likewise, a "patient" or "subject" to be treated by the subject method 
can mean either a human or non-human animal. 

As used herein, the term "vector" refers to a nucleic acid molecule capable of 
30 transporting another nucleic acid to which it has been linked. One type of preferred vector is 
$ an episome, i.e., a nucleic acid capable of extra-chromosomal replication. Preferred vectors 

are those capable of autonomous replication and/expression of nucleic acids to which they are 
linked. Vectors capable of directing the expression of genes to which they are operatively 
linked are referred to herein as "expression vectors". In general, expression vectors of utility 
35 in recombinant DNA techniques are often in the form of "plasmids" which refer generally to 
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circular double stranded DNA loops which, in their vector form are not bound to the 
chromosome. In the present specification, "plasmid" and "vector" are used interchangeably as 
the plasmid is the most commonly used form of vector. However, the invention is intended to 
include such other forms of expression vectors which serve equivalent functions and which 
5 become known in the art subsequently hereto. 

"Transcriptional regulatory sequence" is a generic term used throughout the 
specification to refer to DNA sequences, such as initiation signals, enhancers, and promoters, 
which induce or control transcription of protein coding sequences with which they are 
operably linked. In preferred embodiments, transcription of one of the recombinant 
10 vertebrate hedgehog genes is under the control of a promoter sequence (or other 
>S transcriptional regulatory sequence) which controls the expression of the recombinant gene in 
a cell-type in which expression is intended. It will also be understood that the recombinant 
gene can be under the control of transcriptional regulatory sequences which are the same or 
„ which are different from those sequences which control transcription of the naturally- 

09 15 occurring forms of hedgehog proteins. 

m 

H-l As used herein, the term "tissue-specific promoter" means a DNA sequence that 

f serves as a promoter, i.e., regulates expression of a selected DNA sequence operably 

5 linked to the promoter, and which effects expression of the selected DNA sequence in 

* specific cells of a tissue, such as cells of neural origin, e.g. neuronal cells. The term also 

□ 20 covers so-called "leaky" promoters, which regulate expression of a selected DNA primarily 

;i in one tissue, but cause expression in other tissues as well. 

0 

fd As used herein, the term "target tissue" refers to connective tissue, cartilage, bone 

% tissue or limb tissue, which is either present in an animal, e.g., a mammal, e.g., a human 

or is present in in vitro culture, e.g, a cell culture. 

, 25 As used herein, a "transgenic animal" is any animal, preferably a non-human 

I mammal, bird or an amphibian, in which one or more of the cells of the animal contain 

heterologous nucleic acid introduced by way of human intervention, such as by transgenic 
techniques well known in the art. The nucleic acid is introduced into the cell, directly or 
indirectly by introduction into a precursor of the cell, by way of deliberate genetic 

30 manipulation, such as by microinjection or by infection with a recombinant virus. The term 
genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but 
rather is directed to the introduction of a recombinant DNA molecule. This molecule may be 
integrated within a chromosome, or it may be extrachromosomally replicating DNA. In the 
typical transgenic animals described herein, the transgene causes cells to express a 

35 recombinant form of one of the vertebrate hh proteins, e.g. either agonistic or antagonistic 
forms. However, transgenic animals in which the recombinant vertebrate hh gene is silent 
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are also contemplated, as for example, the FLP or CRE recombinase dependent constructs 
described below. The "non-human animals" of the invention include vertebrates such as 
rodents, non-human primates, sheep, dog, cow, chickens, amphibians, reptiles, etc. Preferred 
non-human animals are selected from the rodent family including rat and mouse, most 
5 preferably mouse, though transgenic amphibians, such as members of the Xenopus genus, and 
transgenic chickens can also provide important tools for understanding and identifying agents 
which can affect, for example, embryogenesis and tissue formation. The term "chimeric 
animal" is used herein to refer to animals in which the recombinant gene is found, or in which 
the recombinant is expressed in some but not all cells of the animal. The term "tissue- 
1 0 specific chimeric animal" indicates that one of the recombinant vertebrate hh genes is present 
and/or expressed in some tissues but not others. 

As used herein, the term "transgene" means a nucleic acid sequence (encoding, e.g., 
one of the vertebrate hh polypeptides), which is partly or entirely heterologous, i.e., foreign, 
to the transgenic animal or cell into which it is introduced, or, is homologous to an 
15 endogenous gene of the transgenic animal or cell into which it is introduced, but which is 
designed to be inserted, or is inserted, into the animal's genome in such a way as to alter the 
genome of the cell into which it is inserted (e.g., it is inserted at a location which differs from 
that of the natural gene or its insertion results in a knockout). A transgene can include one or 
more transcriptional regulatory sequences and any other nucleic acid, such as introns, that 
20 may be necessary for optimal expression of a selected nucleic acid. 

As is well known, genes for a particular polypeptide may exist in single or multiple 
copies within the genome of an individual. Such duplicate genes may be identical or may 
have certain modifications, including nucleotide substitutions, additions or deletions, which 
all still code for polypeptides having substantially the same activity. The term "DNA 
25 sequence encoding a vertebrate hh polypeptide" may thus refer to one or more genes within a 
particular individual. Moreover, certain differences in nucleotide sequences may exist 
between individual organisms, which are called alleles. Such allelic differences may or may 
not result in differences in amino acid sequence of the encoded polypeptide yet still encode a 
protein with the same biological activity. 
30 "Homology" refers to sequence similarity between two peptides or between two 

nucleic acid molecules. Homology can be determined by comparing a position in each 
sequence which may be aligned for purposes of comparison. When a position in the 
compared sequence is occupied by the same base or amino acid, then the molecules are 
homologous at that position. A degree of homology between sequences is a function of the 
35 number of matching or homologous positions shared by the sequences. An "unrelated" or 
"non-homologous" sequence shares less than 40 percent identity, though preferably less than 
25 percent identity, with one of the vertebrate hh sequences of the present invention. 
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"Cells," "host cells" or "recombinant host cells" are terms used interchangeably 
herein. It is understood that such terms refer not only to the particular subject cell but to the 
progeny or potential progeny of such a cell. Because certain modifications may occur in 
succeeding generations due to either mutation or environmental influences, such progeny 
may not, in fact, be identical to the parent cell, but are still included within the scope of the 
term as used herein. 

As used herein, "recombinant cells" include any cells that have been modified by the 
introduction of heterologous DNA. Control cells include cells that are substantially identical 
to the recombinant cells, but do not express one or more of the proteins encoded by the 
heterologous DNA, e.g., do not include or express one or more of the exogenous 
phospholipase, regulatory protein, test polypeptide, or the reporter gene construct. 

As used herein, the terms "heterologous DNA" or "heterologous nucleic acid" is 
meant to include DNA that does not occur naturally as part of the genome in which it is 
present or DNA which is found in a location or locations in the genome that differs from 
that in which it occurs in nature. Heterologous DNA is not endogenous to the cell into 
which it is introduced, but has been obtained from another cell, i.e., is exogenous to the 
cell. Generally, although not necessarily, such DNA encodes RNA and proteins that are 
not normally produced by the cell in which it is expressed. Heterologous DNA may also 
be referred to as foreign DNA. Any DNA that one of skill in the art would recognize or 
consider as heterologous or foreign to the cell in which is expressed is herein 
encompassed by the term heterologous DNA. Examples of heterologous DNA include, 
but are not limited to, DNA that encodes a phospholipase, test polypeptides, regulatory 
proteins, reporter genes, transcriptional and translational regulatory sequences, or 
selectable or traceable marker proteins, such as a protein that confers drug resistance. 

The terms "recombinant protein", "heterologous protein" and "exogenous protein" are 
used interchangeably throughout the specification and refer to a polypeptide which is 
produced by recombinant DNA techniques, wherein generally, DNA encoding the 
polypeptide is inserted into a suitable expression vector which is in turn used to transform a 
host cell to produce the heterologous protein. That is, the polypeptide is expressed from a 
heterologous nucleic acid. 

A "chimeric protein" or "fusion protein" is a fusion of a first amino acid sequence 
encoding one of the subject vertebrate hh polypeptides with a second amino acid sequence 
defining a domain foreign to and not substantially homologous with any domain of one of the 
vertebrate hh proteins. A chimeric protein may present a foreign domain which is found 
(albeit in a different protein) in an organism which also expresses the first protein, or it may 
be an "interspecies", "intergenic", etc. fusion of protein structures expressed by different 
kinds of organisms. In general, a fusion protein can be represented by the general formula X- 
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hh-Y, wherein hh represents a portion of the protein which is derived from one of the 
vertebrate hh proteins, and X and Y are independently absent or represent amino acid 
sequences which are not related to one of the vertebrate hh sequences in an organism, 
including naturally occurring mutants. 

As used herein, a "reporter gene construct" is a nucleic acid that includes a 
"reporter gene" operatively linked to a transcriptional regulatory sequences. 
Transcription of the reporter gene is controlled by these sequences. The activity of at 
least one or more of these control sequences is directly or indirectly regulated by a signal 
transduction pathway involving a phospholipase, e.g., is directly or indirectly regulated 
by a second messenger produced by the phospholipase activity. The transcriptional 
regulatory sequences can include a promoter and other regulatory regions, such as 
enhancer sequences, that modulate the activity of the promoter, or regulatory sequences 
that modulate the activity or efficiency of the RNA polymerase that recognizes the 
promoter, or regulatory sequences that are recognized by effector molecules, including 
those that are specifically induced upon activation of a phospholipase. For example, 
modulation of the activity of the promoter may be effected by altering the RNA 
polymerase binding to the promoter region, or, alternatively, by interfering with 
initiation of transcription or elongation of the mRNA. Such sequences are herein 
collectively referred to as transcriptional regulatory elements or sequences. In addition, 
the construct may include sequences of nucleotides that alter the stability or rate of 
translation of the resulting mRNA in response to second messages, thereby altering the 
amount of reporter gene product. 

As used herein, the terms "transforming growth factor-beta" and "TGF-p" denote a 
family of structurally related paracrine polypeptides found ubiquitously in vertebrates, and 
prototypic of a large family of metazoan growth, differentiation, and morphogenesis factors 
(see, for review, Massaque et al. (1990) Ann Rev Cell Biol 6:597-641 ; and Sporn et al. (1992) 
J Cell Biol 119:1017-1021). Included in this family are the "bone morphogenetic proteins" 
or "BMPs", which refers to proteins isolated from bone, and fragments thereof and synthetic 
peptides which are capable of inducing bone deposition alone or when combined with 
appropriate cofactors. Preparation of BMPs, such as BMP-1, -2, -3, and -4, is described in, 
for example, PCT publication WO 88/00205. Wozney (1989) Growth Fact Res 1:267-280 
describes additional BMP proteins closely related to BMP-2, and which have been designated 
BMP-5, -6, and -7. PCT publications WO89/09787 and WO89/09788 describe a protein 
called "OP-1," now known to be BMP-7. Other BMPs are known in the art. 

The term "isolated" as also used herein with respect to nucleic acids, such as DNA or 
RNA, refers to molecules separated from other DNAs, or RNAs, respectively, that are present 
in the natural source of the macromolecule. For example, an isolated nucleic acid encoding 
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one of the subject vertebrate hh polypeptides preferably includes no more than 10 kilobases 
(kb) of nucleic acid sequence which naturally immediately flanks the vertebrate hh gene in 
genomic DNA, more preferably no more than 5kb of such naturally occurring flanking 
sequences, and most preferably less than 1.5kb of such naturally occurring flanking sequence. 
5 The term isolated as used herein also refers to a nucleic acid or peptide that is substantially 
free of cellular material, viral material, or culture medium when produced by recombinant 
DNA techniques, or chemical precursors or other chemicals when chemically synthesized. 
Moreover, an "isolated nucleic acid" is meant to include nucleic acid fragments which are not 
naturally occurring as fragments and would not be found in the natural state. 

10 As used herein the term "approximately 19 kDa" with respect to N-terminal bioactive 

fragments of a hedgehog protein, refers to a polypeptide which can range in size from 16 kDa 
to 22 kDa, more preferably 18-20 kDa. In a preferred embodiment, "approximately 19 kDa" 
refers to a mature form of the peptide after the cleavage of the signal sequence and 

^ proteolysis to release an N-terminal portion of the mature protein. For instance, in the case of 

p 15 the Sonic hedgehog polypeptide, a fragment of approximately 19 kDa is generated when the 
mature polypeptide is cleaved at a proteolytic processing site which is located in the region 

£ between Ala- 169 and Gly-178 of SEQ ID No:40, e.g., a fragment from Cys-1 to Gly-174 of 

VI SEQIDNo:40. 

s 

Likewise, the term "approximately 27 kDa" with respect to C-terminal fragments of a 
q 20 hedgehog protein, refers to a polypeptide which can range in size from 24 kDa to 30 kDa, 
N more preferably 26-29 kDa. In a preferred embodiment, "approximately 27 kDa" refers to a 

jy mature form of the C-terminal polypeptide after proteolysis to release an N-terminal portion 

^0 of the mature protein. 

As described below, one aspect of the invention pertains to isolated nucleic acids 
25 comprising the nucleotide sequences encoding vertebrate hh homologues, and/or equivalents 
of such nucleic acids. The term nucleic acid as used herein is intended to include fragments 
as equivalents. The term equivalent is understood to include nucleotide sequences encoding 
functionally equivalent hedgehog polypeptides or functionally equivalent peptides having an 
activity of a vertebrate hedgehog protein such as described herein. Equivalent nucleotide 
30 sequences will include sequences that differ by one or more nucleotide substitutions, 
additions or deletions, such as allelic variants; and will, therefore, include sequences that 
differ from the nucleotide sequence of the vertebrate hedgehog cDNAs shown in SEQ ID 
Nos:l-7 due to the degeneracy of the genetic code. Equivalents will also include nucleotide 
sequences that hybridize under stringent conditions (i.e., equivalent to about 20-27°C below 
35 the melting temperature (T m ) of the DNA duplex formed in about 1M salt) to the nucleotide 
sequences represented in one or more of SEQ ID Nos:l-7. In one embodiment, equivalents 
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wiil further include nucleic acid sequences derived from and evolutionarily related to, a 
nucleotide sequences shown in any of SEQ ID Nos:l-7. 

Moreover, it will be generally appreciated that, under certain circumstances, it may be 
advantageous to provide homologs of one of the subject hedgehog polypeptides which 
function in a limited capacity as one of either a hedgehog agonist (mimetic) or a hedgehog 
antagonist, in order to promote or inhibit only a subset of the biological activities of the 
naturally-occurring form of the protein. Thus, specific biological effects can be elicited by 
treatment with a homolog of limited function, and with fewer side effects relative to 
treatment with agonists or antagonists which are directed to all of the biological activities of 
naturally occurring forms of hedgehog proteins. 

Homologs of one of the subject hedgehog proteins can be generated by mutagenesis, 
such as by discrete point mutation(s), or by truncation. For instance, mutation can give rise 
to homologs which retain substantially the same, or merely a subset, of the biological activity 
of the hh polypeptide from which it was derived. Alternatively, antagonistic forms of the 
protein can be generated which are able to inhibit the function of the naturally occurring form 
of the protein, such as by competitively binding to an hh receptor. 

In general, polypeptides referred to herein as having an activity (e.g., are "bioactive") 
of a vertebrate hh protein are defined as polypeptides which include an amino acid sequence 
corresponding (e.g., identical or homologous) to all or a portion of the amino acid sequences 
of a vertebrate hh proteins shown in any of SEQ ID No:8, SEQ ID No:9, SEQ ID No: 10, 
SEQ ID Norll, SEQ ID No:12, SEQ ID No:13 or SEQ ID No:14 and which mimic or 
antagonize all or a portion of the biological/biochemical activities of a naturally occuring 
hedgehog protein. Examples of such biological activity include the ability to induce (or 
otherwise modulate) formation and differentiation of the head, limbs, lungs, central nervous 
system (CNS), digestive tract or other gut components, or mesodermal patterning of 
developing vertebrate embryos. As set out in USSN 08/356,060 and 08/176,427, the 
vertebrate hedgehog proteins, especially Shh, can constitute a general ventralizing activity. 
For instance, the subject polypeptides can be characterized by an ability to induce and/or 
maintain differentiation of neurons, e.g., motorneurons, cholinergic neurons, dopanergic 
neurons, serotenergic neurons, peptidergic neurons and the like. In preferred embodiments, 
the biological activity can comprise an ability to regulate neurogenesis, such as a motor 
neuron inducing activity, a neuronal differentiation inducing activity, or a neuronal survival 
promoting activity. Hedgehog proteins of the present invention can also have biological 
activities which include an ability to regulate organogensis, such as through the ability to 
influence limb patterning, by, for example, skeletogenic activity. The biological activity 
associated with the hedgehog proteins of the present invention can also include the ability to 
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induce stem cell or germ cell differentiation, including the ability to induce differentiation of 
chondrocytes or an involvement in spermatogenesis. 

Hedgehog proteins of the present invention can also be characterized in terms of 
biological activities which include; an ability to modulate proliferation, survival and/or 
differentiation of mesodermally-derived tissue, such as tissue derived from dorsal mesoderm; 
the ability to modulate proliferation, survival and/or differentiation of ectodermally-derived 
tissue, such as tissue derived from the neural tube, neural crest, or head mesenchyme; the 
ability to modulate proliferation, survival and/or differentiation of endodermally-derived 
tissue, such as tissue derived from the primitive gut. Moreover, as described in the Examples 
below, the subject hedgehog proteins have the ability to induce expression of secondary 
signaling molecules, such as members of the Transforming Growth Factor p (TGFp) family, 
including bone morphogenic proteins, e.g. BMP -2 and BMP-4, as well as members of the 
fibroblast growth factor (FGF) family, such as Fgf-4. Other biological activities of the 
subject hedgehog proteins are described herein or will be reasonably apparent to those skilled 
in the art. According to the present invention, a polypeptide has biological activity if it is a 
specific agonist or antagonist of a naturally-occurring form of a vertebrate hedgehog protein. 

Preferred nucleic acids encode a vertebrate hedgehog polypeptide comprising an 
amino acid sequence at least 60% homologous, more preferably 70% homologous and most 
preferably 80% homologous with an amino acid sequence selected from the group consisting 
of SEQ ID Nos:8-14. Nucleic acids which encode polypeptides at least about 90%, more 
preferably at least about 95%, and most preferably at least about 98-99% homology with an 
amino acid sequence represented in one of SEQ ID Nos:8-14 are also within the scope of the 
invention. In one embodiment, the nucleic acid is a cDNA encoding a peptide having at least 
one activity of the subject vertebrate hh polypeptide. Preferably, the nucleic acid includes all 
or a portion of the nucleotide sequence corresponding to the coding region of SEQ ID Nos:l- 
7. 

Preferred nucleic acids encode a- bioactive fragment of a vertebrate hedgehog 
polypeptide comprising an amino acid sequence at least 60% homologous, more preferably 
70% homologous and most preferably 80% homologous with an amino acid sequence 
selected from the group consisting of SEQ ID Nos:8-14. Nucleic acids which encode 
polypeptides at least about 90%, more preferably at least about 95%, and most preferably at 
least about 98-99% homology, or identical, with an amino acid sequence represented in one 
of SEQ ID Nos:8-14 are also within the scope of the invention. 

With respect to bioctive fragments of sonic clones, a preferred nucleic acid encodes a 
polypeptide including a hedgehog portion having molecular weight of approximately 19 kDa 
and which polyptide can modulate, e.g., mimic or antagonize, a hedgehog biological activity. 
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Preferably, the polypeptide encoded by the nucleic acid comprises an amino acid sequence 
identical or homologous to an amino acid sequence designated in one of SEQ ID No:8, SEQ 
ID No:9, SEQ ID No:10, SEQ ID No:l 1, SEQ ID No:12, SEQ ID No:13, or SEQ ID No:14. 
More preferably, the polypeptide comprises an amino acid sequence designated in SEQ ID 
No:40. 

A preferred nucleic acid encodes a hedgehog polypeptide comprising an amino acid 
sequence represented by the formula A-B wherein, A represents all or the portion of the 
amino acid sequence designated by residues 1-168 of SEQ ID No:40; and B represents at 
least one amino acid residue of the amino acid sequence designated by residues 169-221 of 
SEQ ID No:40; wherein A and B together represent a contiguous polypeptide sequence 
designated by SEQ ID No:40. Preferably, B can represent at least five, ten or twenty amino 
acid residues of the amino acid sequence designated by residues 169-221 of SEQ ID No:40. 

To further illustrate, another preferred nucleic acid encodes a polypeptide comprising 
an amino acid sequence represented by the formula A-B, wherein A represents all or the 
portion of the amino acid sequence designated by residues 24-193 of SEQ ID No: 13; and B 
represents at least one amino acid residue of the amino acid sequence designated by residues 
194-250 of SEQ ID No: 13; wherein A and B together represent a contiguous polypeptide 
sequence designated in SEQ ID No: 13, and the polypeptide modulates, e.g., agonizes or 
antagonizes, the biological activity of a hedgehog protein. 

Yet another preferred nucleic acid encodes a polypeptide comprising an amino acid 
sequence represented by the formula A-B, wherein A represents all or the portion, e.g., 25, 
50, 75 or 100 residues, of the amino acid sequence designated by residues 25-193, or 
analogous residues thereof, of a vertebrate hedgehog polypeptide identical or homologous to 
SEQ ID No:l 1; and B represents at least one amino acid residue of the amino acid sequence 
designated by residues 194-250, or analogous residues thereof, of a vertebrate hedgehog 
polypeptide identical or homologous to SEQ ID No: 1 1; wherein A and B together represent a 
contiguous polypeptide sequence designated in SEQ ID No: 11. 

Another preferred nucleic acid encodes a polypeptide comprising an amino acid 
sequence represented by the formula A-B, wherein A represents all or the portion, e.g., 25, 
50, 75 or 100 residues, of the amino acid sequence designated by residues 23-193 of SEQ ID 
No:9; and B represents at least one amino acid residue of the amino acid sequence designated 
by residues 194-250 of SEQ ID No:9; wherein A and B together represent a contiguous 
polypeptide sequence designated in SEQ ID No:9, and the polypeptide modulates, e.g., 
agonizes or antagonizes, the biological activity of a hedgehog protein. 

Another preferred nucleic acid encodes a polypeptide comprising an amino acid 
sequence represented by the formula A-B, wherein A represents all or the portion, e.g., 25, 
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50, 75 or 100 residues, of the amino acid sequence designated by residues 28-197 of SEQ ED 
No: 10; and B represents at least one amino acid residue of the amino acid sequence 
designated by residues 198-250 of SEQ ID No: 10; wherein A and B together represent a 
contiguous polypeptide sequence designated in SEQ ID No: 10, and the polypeptide 
modulates, e.g., agonizes or antagonizes, the biological activity of a hedgehog protein. 

Yet another preferred nucleic acid encodes a polypeptide comprising an amino acid 
sequence represented by the formula A-B, wherein A represents all or the portion, e.g., 25, 50 
or 75 residues, of the amino acid sequence designated by residues 1-98, or analogous residues 
thereof, of a vertebrate hedgehog polypeptide identical or homologous to SEQ ID No: 14; and 
B represents at least one amino acid residue of the amino acid sequence designated by 
residues 99-150, or analogous residues thereof, of a vertebrate hedgehog polypeptide 
identical or homologous to SEQ ID No:14; wherein A and B together represent a contiguous 
polypeptide sequence designated in SEQ ID No: 14. 

Another aspect of the invention provides a nucleic acid which hybridizes under high 
or low stringency conditions to a nucleic acid represented by one of SEQ ID Nos:l-7. 
Appropriate stringency conditions which promote DNA hybridization, for example, 6.0 x 
sodium chloride/sodium citrate (SSC) at about 45°C, followed by a wash of 2.0 x SSC at 
50°C, are known to those skilled in the art or can be found in Current Protocols in Molecular 
Biology, John Wiley & Sons, N.Y. (1989), 63.1-6.3.6. For example, the salt concentration in 
the wash step can be selected from a low stringency of about 2.0 x SSC at 50°C to a high 
stringency of about 0.2 x SSC at 50°C. In addition, the temperature in the wash step can be 
increased from low stringency conditions at room temperature, about 22°C, to high 
stringency conditions at about 65 °C. 

Nucleic acids, having a sequence that differs from the nucleotide sequences shown in 
one of SEQ ID No:l, SEQ ID No:2, SEQ ID No:3, SEQ ID No:4, SEQ ID No:5, SEQ ID 
No:6 or SEQ ID No:7 due to degeneracy in the genetic code are also within the scope of the 
invention. Such nucleic acids encode functionally equivalent peptides (i.«., a peptide having 
a biological activity of a vertebrate hh polypeptide) but differ in sequence from the sequence 
shown in the sequence listing due to degeneracy in the genetic code. For example, a number 
of amino acids are designated by more than one triplet. Codons that specify the same amino 
acid, or synonyms (for example, CAU and CAC each encode histidine) may result in "silent" 
mutations which do not affect the amino acid sequence of a vertebrate hh polypeptide. 
However, it is expected that DNA sequence polymorphisms that do lead to changes in the 
amino acid sequences of the subject hh polypeptides will exist among vertebrates. One 
skilled in the art will appreciate that these variations in one or more nucleotides (up to about 
3-5% of the nucleotides) of the nucleic acids encoding polypeptides having an activity of a 
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vertebrate hh polypeptide may exist among individuals of a given species due to natural 
allelic variation. 

As used herein, a hedgehog gene fragment refers to a nucleic acid having fewer 
nucleotides than the nucleotide sequence encoding the entire mature form of a vertebrate hh 
5 protein yet which (preferably) encodes a polypeptide which retains some biological activity 
of the full length protein. 

As indicated by the examples set out below, hedgehog protein-encoding nucleic acids 
can be obtained from mRNA present in any of a number of eukaryotic cells. It should also be 
possible to obtain nucleic acids encoding vertebrate hh polypeptides of the present invention 

10 from genomic DNA obtained from both adults and embryos. For example, a gene encoding a 
hh protein can be cloned from either a cDNA or a genomic library in accordance with 
protocols described herein, as well as those generally known to persons skilled in the art. A 
cDNA encoding a hedgehog protein can be obtained by isolating total mRNA from a cell, 
e.g. a mammalian cell, e.g. a human cell, including embryonic cells. Double stranded 

15 cDNAs can then be prepared from the total mRNA, and subsequently inserted into a suitable 
plasmid or bacteriophage vector using any one of a number of known techniques. The gene 
encoding a vertebrate hh protein can also be cloned using established polymerase chain 
reaction techniques in accordance with the nucleotide sequence information provided by the 
invention. The nucleic acid of the invention can be DNA or RNA. A preferred nucleic acid 

20 is a cDNA represented by a sequence selected from the group consisting of SEQ ID Nos: 1 -7. 

Another aspect of the invention relates to the use of the isolated nucleic acid in 
"antisense" therapy. As used herein, "antisense" therapy refers to administration or in situ 
generation of oligonucleotide probes or their derivatives which specifically hybridizes (e.g. 
binds) under cellular conditions, with the cellular mRNA and/or genomic DNA encoding one 

25 or more of the subject hedgehog proteins so as to inhibit expression of that protein, e.g. by 
inhibiting transcription and/or translation. The binding may be by conventional base pair 
complementarity, or, for example, in the case of binding to DNA duplexes, through specific 
interactions in the major groove of the double helix. In general, "antisense" therapy refers to 
the range of techniques generally employed in the art, and includes any therapy which relies 

30 on specific binding to oligonucleotide sequences. 

An antisense construct of the present invention can be delivered, for example, as an 
expression plasmid which, when transcribed in the cell, produces RNA which is 
complementary to at least a unique portion of the cellular mRNA which encodes a vertebrate 
hh protein. Alternatively, the antisense construct is an oligonucleotide probe which is 
35 generated ex vivo and which, when introduced into the cell causes inhibition of expression by 
hybridizing with the mRNA and/or genomic sequences of a vertebrate hh gene. Such 
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oligonucleotide probes are preferably modified oligonucleotide which are resistant to 
endogenous nucleases, e.g. exonucleases and/or endonucleases, and is therefore stable in 
vivo. Exemplary nucleic acid molecules for use as antisense oligonucleotides are 
phosphoramidate, phosphothioate and methylphosphonate analogs of DNA (see also U.S. 
5 Patents 5,176,996; 5,264,564; and 5,256,775). Additionally, general approaches to 
constructing oligomers useful in antisense therapy have been reviewed, for example, by Van 
der Krol et al. (1988) Bioteckniques 6:958-976; and Stein et al. (1988) Cancer Res 48:2659- 
2668. 

Accordingly, the modified oligomers of the invention are useful in therapeutic, 
10 diagnostic, and research contexts. In therapeutic applications, the oligomers are utilized in a 
manner appropriate for antisense therapy in general. For such therapy, the oligomers of the 
invention can be formulated for a variety of loads of administration, including systemic and 
topical or localized administration. Techniques and formulations generally may be found in 
p Remmington's Pharmaceutical Sciences. Meade Publishing Co., Easton, PA. For systemic 

0 15 administration, injection is preferred, including intramuscular, intravenous, intraperitoneal, 
and subcutaneous for injection, the oligomers of the invention can be formulated in liquid 
j£ solutions, preferably in physiologically compatible buffers such as Hank's solution or 

W Ringer's solution. In addition, the oligomers may be formulated in solid form and 

C3 

yj redissolved or suspended immediately prior to use. Lyophilized forms are also included. 

5 

Q 20 Systemic administration can also be by transmucosal or transdermal means, or the 

N 1 compounds can be administered orally. For transmucosal or transdermal administration, 

penetrants appropriate to the barrier to be permeated are used in the formulation. Such 
Jl penetrants are generally known in the art, and include, for example, for transmucosal 

fl^ administration bile salts and fusidic acid derivatives. In addition, detergents may be used to 

25 facilitate permeation. Transmucosal administration may be through nasal sprays or using 
suppositories. For oral administration, the oligomers are formulated into conventional oral 
administration forms such as capsules, tablets, and tonics. For topical administration, the 
oligomers of the invention are formulated into ointments, salves, gels, or creams as generally 
known in the art. 

30 In addition to use in therapy, the oligomers of the invention may be used as diagnostic 

reagents to detect the presence or absence of the target DNA or RNA sequences to which 
they specifically bind. Such diagnostic tests are described in further detail below. 

Likewise, the antisense constructs of the present invention, by antagonizing the 
normal biological activity of one of the hedgehog proteins, can be used in the manipulation 
35 of tissue, e.g. tissue differentiation, both in vivo and in ex vivo tissue cultures. 
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Also, the anti-sense techniques (e.g. microinjection of antisense molecules, or 
transfection with plasmids whose transcripts are anti-sense with regard to an hh mRNA or 
gene sequence) can be used to investigate role of hh in developmental events, as well as the 
normal cellular function of hh in adult tissue. Such techniques can be utilized in cell culture, 
but can also be used in the creation of transgenic animals. 

This invention also provides expression vectors containing a nucleic acid encoding a 
vertebrate hh polypeptide, operably linked to at least one transcriptional regulatory sequence. 
Operably linked is intended to mean that the nucleotide sequence is linked to a regulatory 
sequence in a manner which allows expression of the nucleotide sequence. Regulatory 
sequences are art-recognized and are selected to direct expression of the subject vertebrate hh 
proteins. Accordingly, the term transcriptional regulatory sequence includes promoters, 
enhancers and other expression control elements. Such regulatory sequences are described in 
Goeddel; Gene Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, CA (1990). For instance, any of a wide variety of expression control sequences, 
sequences that control the expression of a DNA sequence when operatively linked to it, may 
be used in these vectors to express DNA sequences encoding vertebrate hh polypeptides of 
this invention. Such useful expression control sequences, include, for example, a viral LTR, 
such as the LTR of the Moloney murine leukemia virus, the early and late promoters of 
SV40, adenovirus or cytomegalovirus immediate early promoter, the lac system, the trp 
system, the TAC or TRC system, T7 promoter whose expression is directed by 17 RNA 
polymerase, the major operator and promoter regions of phage X , the control regions for fd 
coat protein, the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the 
promoters of acid phosphatase, e.g., Pho5, the promoters of the yeast a-mating factors, the 
polyhedron promoter of the baculovirus system and other sequences known to control the 
expression of genes of prokaryotic or eukaryotic cells or their viruses, and various 
combinations thereof. It should be understood that the design of the expression vector may 
depend on such factors as the choice of the host cell to be transformed and/or the type of 
protein desired to be expressed. Moreover, the vector's copy number, the ability to control 
that copy number and the expression of any other proteins encoded by the vector, such as 
antibiotic markers, should also be considered. In one embodiment, the expression vector 
includes a recombinant gene encoding a peptide having an agonistic activity of a subject 
hedgehog polypeptide, or alternatively, encoding a peptide which is an antagonistic form of 
the hh protein. Such expression vectors can be used to transfect cells and thereby produce 
polypeptides, including fusion proteins, encoded by nucleic acids as described herein. 

Moreover, the gene constructs of the present invention can also be used as a part of a 
gene therapy protocol to deliver nucleic acids encoding either an agonistic or antagonistic 
form of one of the subject vertebrate hedgehog proteins. Thus, another aspect of the 
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invention features expression vectors for in vivo or in vitro transfection and expression of a 
vertebrate hh polypeptide in particular cell types so as to reconstitute the function of, or 
alternatively, abrogate the function of hedgehog-induced signaling in a tissue in which the 
naturally-occurring form of the protein is misexpressed; or to deliver a form of the protein 
which alters differentiation of tissue, or which inhibits neoplastic transformation. 

Expression constructs of the subject vertebrate hh polypeptide, and mutants thereof, 
may be administered in any biologically effective carrier, e.g. any formulation or 
composition capable of effectively delivering the recombinant gene to cells in vivo. 
Approaches include insertion of the subject gene in viral vectors including recombinant 
retroviruses, adenovirus, adeno-associated virus, and herpes simplex virus- 1, or recombinant 
bacterial or eukaryotic plasmids. Viral vectors transfect cells directly; plasmid DNA can be 
delivered with the help of, for example, cationic liposomes (lipofectin) or derivatized (e.g. 
antibody conjugated), polylysine conjugates, gramacidin S, artificial viral envelopes or other 
such intracellular carriers, as well as direct injection of the gene construct or CaP0 4 
precipitation carried out in vivo. It will be appreciated that because transduction of 
appropriate target cells represents the' critical first step in gene therapy, choice of the 
particular gene delivery system will depend on such factors as the phenotype of the intended 
target and the route of administration, e.g. locally or systemically. Furthermore, it will be 
recognized that the particular gene construct provided for in vivo transduction of hedgehog 
expression are also useful for in vitro transduction of cells, such as for use in the ex vivo 
tissue culture systems described below. 

A preferred approach for in vivo introduction of nucleic acid into a cell is by use of a 
viral vector containing nucleic acid, e.g. a cDNA, encoding the particular form of the 
hedgehog polypeptide desired. Infection of cells with a viral vector has the advantage that a 
large proportion of the targeted cells can receive the nucleic acid. Additionally, molecules 
encoded within the viral vector, e.g., by a cDNA contained in the viral vector, are expressed 
efficiently in cells which have taken up viral vector nucleic acid* 

Retrovirus vectors and adeno-associated virus vectors are generally understood to be 
the recombinant gene delivery system of choice for the transfer of exogenous genes in vivo, 
particularly into humans. These vectors provide efficient delivery of genes into cells, and the 
transferred nucleic acids are stably integrated into the chromosomal DNA of the host. A 
major prerequisite for the use of retroviruses is to ensure the safety of their use, particularly 
with regard to the possibility of the spread of wild-type virus in the cell population. The 
development of specialized cell lines (termed "packaging cells") which produce only 
replication-defective retroviruses has increased the utility of retroviruses for gene therapy, 
and defective retroviruses are well characterized for use in gene transfer for gene therapy 
purposes (for a review see Miller, A.D. (1990) Blood 76:271). Thus, recombinant retrovirus 
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can be constructed in which part of the retroviral coding sequence (gag, pol, env) has been 
replaced by nucleic acid encoding one of the subject proteins rendering the retrovirus 
replication defective. The replication defective retrovirus is then packaged into virions which 
can be used to infect a target cell through the use of a helper virus by standard techniques. 
5 Protocols for producing recombinant retroviruses and for infecting cells in vitro or in vivo 
with such viruses can be found in Current Protocols in Molecular Biology. Ausubel, F.M. et 
al. (eds.) Greene Publishing Associates, (1989), Sections 9.10-9.14 and other standard 
laboratory manuals. Examples of suitable retroviruses include pLJ, pZIP, pWE and pEM 
which are well known to those skilled in the art. Examples of suitable packaging virus lines 
10 for preparing both ecotropic and amphotropic retroviral systems include \yCrip, \}/Cre, \\f2 
and \|/Am. Retroviruses have been used to introduce a variety of genes into many different 
cell types, including neuronal cells, in vitro and/or in vivo (see for example Eglitis, et al. 
(1985) Science 230:1395-1398; Danos and Mulligan (1988) Proa Natl Acad. Scl USA 
85:6460-6464; Wilson et al. (1988) Proc. Natl Acad Sci. USA 85:3014-3018; Armentano et 
P 15 al. (1990) Proc. Natl Acad ScL USA 87:6141-6145; Huber et al. (1991) Proa Natl Acad 
jp Sci. USA 88:8039-8043; Ferry et al. (1991) Proc, Natl Acad Sci. USA 88:8377-8381; 

*J Chowdhury et al. (1991) Science 254:1802-1805; van Beusechem et al. (1992) Proa Natl 

* Acad Scl USA 89:7640-7644; Kay et al. (1992) Human Gene Therapy 3:641-647; Dai et al. 

p (1992) Proa Natl Acad Scl USA 89:10892-10895; Hwu et al. (1993) J. Immunol 150:4104- 

*0 20 4115; U.S. Patent No. 4,868,116; U.S. Patent No. 4,980,286; PCT Application WO 
q 89/07136; PCT Application WO 89/02468; PCT Application WO 89/05345; and PCT 

N Application WO 92/07573). 

|jj Furthermore, it has been shown that it is possible to limit the infection spectrum of 

|P retroviruses and consequently of retroviral-based vectors, by modifying the viral packaging 

25 proteins on the surface of the viral particle (see, for example PCT publications W093/25234 
and WO94/06920). For instance, strategies for the modification of the infection spectrum of 
retroviral vectors include: coupling antibodies specific for cell surface antigens to the viral 
env protein (Roux et *L (1989) PNAS 86:9079-9083; Man et al. (1992) J. Gen Virol 
73:3251-3255; and Goud et al. (1983) Virology 163:251-254); or coupling cell surface 

30 receptor ligands to the viral env proteins (Neda et al. (1991) J Biol Chem 266:14143-14146). 
Coupling can be in the form of the chemical cross-linking with a protein or other variety (e.g. 
lactose to convert the env protein to an asialoglycoprotein), as well as by generating fusion 
proteins (e.g. single-chain antibody/e«v fusion proteins). This technique, while useful to 
limit or otherwise direct the infection to certain tissue types, can also be used to convert an 

35 ecotropic vector in to an amphotropic vector. 
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Moreover, use of retroviral gene delivery can be further enhanced by the use of tissue- 
or cell-specific transcriptional regulatory sequences which control expression of the hh gene 
of the retroviral vector. 

Another viral gene delivery system useful in the present invention utilizes adenovirus- 
5 derived vectors. The genome of an adenovirus can be manipulated such that it encodes and 
expresses a gene product of interest but is inactivated in terms of its ability to replicate in a 
normal lytic viral life cycle. See for example Berkner et aL (1988) BioTechnigues 6:616; 
Rosenfeld et al. (1991) Science 252:431-434; and Rosenfeld et al. (1992) Cell 68:143-155. 
Suitable adenoviral vectors derived from the adenovirus strain Ad type 5 dl324 or other 

10 strains of adenovirus (e.g., Ad2, Ad3, Ad7 etc.) are well known to those skilled in the art. 
Recombinant adenoviruses can be advantageous in certain circumstances in that they can be 
used to infect a wide variety of cell types, including airway epithelium (Rosenfeld et al. 
(1992) cited supra), endothelial cells (Lemarchand et al. (1992) Proc. Natl Acad ScL USA 
89:6482-6486), hepatocytes (Herz and Gerard (1993) Proc. Natl Acad Set USA 90:2812- 

15 2816) and muscle cells (Quantin et al. (1992) Proc. Natl Acad Set USA 89:2581-2584). 
Furthermore, the virus particle is relatively stable and amenable to purification and 
concentration, and as above, can be modified so as to affect the spectrum of infectivity. 
Additionally, introduced adenoviral DNA (and foreign DNA contained therein) is not 
integrated into the genome of a host cell but remains episomal, thereby avoiding potential 

20 problems that can occur as a result of insertional mutagenesis in situations where introduced 
DNA becomes integrated into the host genome (e.g., retroviral DNA). Moreover, the 
carrying capacity of the adenoviral genome for foreign DNA is large (up to 8 kilobases) 
relative to other gene delivery vectors (Berkner et al. cited supra; Haj-Ahmand and Graham 
(1986) J. Virol 57:267). Most replication-defective adenoviral vectors currently in use and 

25 therefore favored by the present invention are deleted for all or parts of the viral El and E3 
genes but retain as much as 80% of the adenoviral genetic material (see, e.g., Jones et al. 
(1979) Cell 16:683; Berkner et al., supra; and Graham et al. in Methods in Molecular 
Biology, EJ. Murray, Ed. (Humana, Clifton, NJ, 1991) vol. 7. pp. 109-127). Expression of 
the inserted hedgehog gene can be under control of, for example, the El A promoter, the 

30 major late promoter (MLP) and associated leader sequences, the E3 promoter, or 
exogenously added promoter sequences. 

Yet another viral vector system useful for delivery of one of the subject vertebrate hh 
genes is the adeno-associated virus (AAV). Adeno-associated virus is a naturally occurring 
defective virus that requires another virus, such as an adenovirus or a herpes virus, as a helper 
35 virus for efficient replication and a productive life cycle. (For a review see Muzyczka et al. 
Cwrr. Topics in Micro, and Immunol (1992) 158:97-129). It is also one of the few viruses 
that may integrate its DNA into non-dividing cells, and exhibits a high frequency of stable ■ 
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integration (see for example Flotte et al. (1992) Am. J. Respir. Cell. Mol Biol 7:349-356; 
Samulski et al. (1989) J, Virol 63:3822-3828; and McLaughlin et al. (1989) J. Virol 
62:1963-1973). Vectors containing as little as 300 base pairs of AAV can be packaged and 
can integrate. Space for exogenous DNA is limited to about 4.5 kb. An AAV vector such as 
5 that described in Tratschin et al. (1985) Mol Cell Biol 5:3251-3260 can be used to introduce 
DNA into cells. A variety of nucleic acids have been introduced into different cell types 
using AAV vectors (see for example Hermonat et al. (1984) Proc. Natl Acad. Set USA 
81:6466-6470; Tratschin et al. (1985) Mol Cell Biol 4:2072-2081; Wondisford et al. (1988) 
Mol Endocrinol 2:32-39; Tratschin et al. (1984) J. Virol 51:611-619; and Flotte et al. 
10 (1993) J. Biol Chem. 268:3781-3790). 

In addition to viral transfer methods, such as those illustrated above, non-viral 
methods can also be employed to cause expression of a subject hedgehog polypeptide in the 
tissue of an animal. Most nonviral methods of gene transfer rely on normal mechanisms used 
by mammalian cells for the uptake and intracellular transport of macromolecules. In 
15 preferred embodiments, non- viral gene delivery systems of the present invention rely on 
endocytic pathways for the uptake of the subject hh polypeptide gene by the targeted cell. 
Exemplary gene delivery systems of this type include liposomal derived systems, poly-lysine 
conjugates, and artificial viral envelopes. 

In clinical settings, the gene delivery systems for the therapeutic hedgehog gene can 
20 be introduced into a patient by any of a number of methods, each of which is familiar in the 
art. For instance, a pharmaceutical preparation of the gene delivery system can be introduced 
systemically, e.g. by intravenous injection, and specific transduction of the protein in the 
target cells occurs predominantly from specificity of transfection provided by the gene 
delivery vehicle, cell-type or tissue-type expression due to the transcriptional regulatory 
25 sequences controlling expression of the receptor gene, or a combination thereof. In other 
embodiments, initial delivery of the recombinant gene is more limited with introduction into 
the animal being quite localized. For example, the gene delivery vehicle can be introduced 
by catheter (see U.S. Patent 5,328,470) or by stereotactic injection (e.g. Chen et al. (1994) 
PNAS 91 : 3054-3057). A vertebrate hh gene, such as any one of the clones represented in the 
30 group consisting of SEQ ID NO: 1-7, can be delivered in a gene therapy construct by 
electroporation using techniques described, for example, by Dev et al. ((1994) Cancer Treat 
Rev 20:105-115). 

The pharmaceutical preparation of the gene therapy construct can consist essentially 
of the gene delivery system in an acceptable diluent, or can comprise a slow release matrix in 
35 which the gene delivery vehicle is imbedded. Alternatively, where the complete gene 
delivery system can be produced intact from recombinant cells, e.g. retroviral vectors, the 
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pharmaceutical preparation can comprise one or more cells which produce the gene delivery 
system. 

Another aspect of the present invention concerns recombinant forms of the hedgehog 
proteins. Recombinant polypeptides preferred by the present invention, in addition to native 
5 hedgehog proteins, are at least 60% homologous, more preferably 70% homologous and most 
preferably 80% homologous with an amino acid sequence represented by any of SEQ ID 
Nos:8-14. Polypeptides which possess an activity of a hedgehog protein (i.e. either agonistic 
or antagonistic), and which are at least 90%, more preferably at least 95%, and most 
preferably at least about 98-99% homologous with a sequence selected from the group 
10 consisting of SEQ ID Nos:8-14 are also within the scope of the inventioa 

The term "recombinant protein" refers to a polypeptide of the present invention which 
is produced by recombinant DNA techniques, wherein generally, DNA encoding a vertebrate 
hh polypeptide is inserted into a suitable expression vector which is in turn used to transform 



O a host cell to produce the heterologous protein. Moreover, the phrase "derived from", with 

p 15 respect to a recombinant hedgehog gene, is meant to include within the meaning of 
"recombinant protein" those proteins having an amino acid sequence of a native hedgehog 

jjj protein, or an amino acid sequence similar thereto which is generated by mutations including 

O substitutions and deletions (including truncation) of a naturally occurring form of the protein. 

iS 

« The present invention further pertains to recombinant forms of one of the subject 

p 20 hedgehog polypeptides which are encoded by genes derived from a vertebrate organism, 

-4 

Q particularly a mammal (e.g. a human), and which have amino acid sequences evolutionarily 



related to the hedgehog proteins represented in SEQ ID Nos:8-14. Such recombinant hh 
polypeptides preferably are capable of functioning in one of either role of an agonist or 
antagonist of at least one biological activity of a wild-type ("authentic") hedgehog protein of 

25 the appended sequence listing. The term "evolutionarily related to", with respect to amino 
acid sequences of vertebrate hedgehog proteins, refers to both polypeptides having amino 
acid sequences which have arisen naturally, and also to mutational variants of vertebrate hh 
polypeptides which are derived, for example, by combinatorial mutagenesis. Such 
evolutionarily derived hedgehog proteins polypeptides preferred by the present invention are 

30 at least 60% homologous, more preferably 70% homologous and most preferably 80% 
homologous with the amino acid sequence selected from the group consisting of SEQ ID 
Nos:8-14. Polypeptides having at least about 90%, more preferably at least about 95%, and 
most preferably at least about 98-99% homology with a sequence selected from the group 
consisting of SEQ ID Nos:8-14 are also within the scope of the invention. 

35 The present invention further pertains to methods of producing the subject hedgehog 

polypeptides. For example, a host cell transfected with a nucleic acid vector directing 
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expression of a nucleotide sequence encoding the subject polypeptides can be cultured under 
appropriate conditions to allow expression of the peptide to occur. The polypeptide 
hedgehog may be secreted and isolated from a mixture of cells and medium containing the 
recombinant vertebrate hh polypeptide. Alternatively, the peptide may be retained 
cytoplasmically by removing the signal peptide sequence from the recombinant hh gene and 
the cells harvested, lysed and the protein isolated. A cell culture includes host cells, media 
and other byproducts. Suitable media for cell culture are well known in the art. The 
recombinant hh polypeptide can be isolated from cell culture medium, host cells, or both 
using techniques known in the art for purifying proteins including ion-exchange 
chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and 
immunoaffinity purification with antibodies specific for such peptide. In a preferred 
embodiment, the recombinant hh polypeptide is a fusion protein containing a domain which 
facilitates its purification, such as an hh/GST fusion protein. 

This invention also pertains to a host cell transfected to express a recombinant form of 
the subject hedgehog polypeptides. The host cell may be any prokaryotic or eukaryotic cell. 
Thus, a nucleotide sequence derived from the cloning of vertebrate hedgehog proteins, 
encoding all or a selected portion of the full-length protein, can be used to produce a 
recombinant form of a vertebrate hh polypeptide via microbial or eukaryotic cellular 
processes. Ligating the polynucleotide sequence into a gene construct, such as an expression 
vector, and transforming or transfecting into hosts, either eukaryotic (yeast, avian, insect or 
mammalian) or prokaryotic (bacterial cells), are standard procedures used in producing other 
well-known proteins, e.g. insulin, interferons, human growth hormone, IL-1, IL-2, and the 
like. Similar procedures, or modifications thereof, can be employed to prepare recombinant 
hedgehog polypeptides by microbial means or tissue-culture technology in accord with the 
subject invention. 

The recombinant hedgehog genes can be produced by ligating nucleic acid encoding 
an hh protein, or a portion thereof, into a vector suitable for expression in either prokaryotic 
cells, eukaryotic cells, or both. Expression vectors for production of recombinant forms of 
the subject hh polypeptides include plasmids and other vectors. For instance, suitable vectors 
for the expression of a hedgehog polypeptide include plasmids of the types: pBR322-derived 
plasmids, pEMBL-derived plasmids, pEX-derived plasmids, pBTac-derived plasmids and 
pUC-derived plasmids for expression in prokaryotic cells, such as E. coli. 

A number of vectors exist for the expression of recombinant proteins in yeast. For 
instance, YEP24, YIPS, YEP51, YEP52, pYES2, and YRP17 are cloning and expression 
vehicles useful in the introduction of genetic constructs into S. cerevisiae (see, for example, 
Broach et al (1983) in Experimental Manipulation of Gene Expression, ed. M. Inouye 
Academic Press, p. 83, incorporated by reference herein). These vectors can replicate in E. 
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coli due the presence of the pBR322 ori, and in S. cerevisiae due to the replication 
determinant of the yeast 2 micron plasmid. In addition* drug resistance markers such as 
ampicillin can be used. In an illustrative embodiment, an hh polypeptide is produced 
recombinantly utilizing an expression vector generated by sub-cloning the coding sequence of 
5 one of the hedgehog genes represented in SEQ ID Nos: 1 -7. 

The preferred mammalian expression vectors contain both prokaryotic sequences, to 
facilitate the propagation of the vector in bacteria, and one or more eukaryotic transcription 
units that are expressed in eukaryotic cells. The pcDNAI/amp, pcDNAI/neo, pRc/CMV, 
pSV2gpt, pSV2neo, pSV2-dhfr, pTk2, pRSVneo, pMSG, pSVT7, pko-neo and pHyg derived 
10 vectors are examples of mammalian expression vectors suitable for transfection of eukaryotic 
cells. Some of these vectors are modified with sequences from bacterial plasmids, such as 
pBR322, to facilitate replication and drug resistance selection in both prokaryotic and 
eukaryotic cells. Alternatively, derivatives of viruses such as the bovine papillomavirus 
p (BPV-1), or Epstein-Barr virus (pHEBo, pREP-derived and p205) can be used for transient 

|3 15 expression of proteins in eukaryotic cells. The various methods employed in the preparation 
of the plasmids and transformation of host organisms are well known in the art. For other 
?z suitable expression systems for both prokaryotic and eukaryotic cells, as well as general 

W recombinant procedures, see Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by 

Q 

Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989) Chapters 16 

* 20 and 17. 
P 

^ In some instances, it may be desirable to express the recombinant hedgehog 

G 

pi polypeptide by the use of a baculovirus expression system. Examples of such baculovirus 

y3 expression systems include pVL-derived vectors (such as pVL1392, pVL1393 and pVL941), 

^ pAcUW-derived vectors (such as pAcUWl), and pBlueBac-derived vectors (such as the B-gal 

25 containing pBlueBac III). 

When it is desirable to express only a portion of an hh protein, such as a form lacking 
a portion of the N-terminus, i.e. a truncation mutant which lacks the signal peptide, it may be 
necessary to add a start codon (ATG) to the oligonucleotide fragment containing the desired 
sequence to be expressed. It is well known in the art that a methionine at the N-terminal 

30 position can be enzymatically cleaved by the use of the enzyme methionine aminopeptidase 
(MAP). MAP has been cloned from E, coli (Ben-Bassat et al. (1987) J Bacteriol 169:751- 
757) and Salmonella typhimurium and its in vitro activity has been demonstrated on 
recombinant proteins (Miller et al. (1987) PNAS 54:2718-1722). Therefore, removal of an N- 
terminal methionine, if desired, can be achieved either in vivo by expressing hedgehog- 

35 derived polypeptides in a host which produces MAP (e.g., E. coli or CM89 or S. cerevisiae\ 
or in vitro by use of purified MAP (e.g., procedure of Miller et al., supra). 
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Alternatively, the coding sequences for the polypeptide can be incorporated as a part 
of a fusion gene including a nucleotide sequence encoding a different polypeptide. This type 
of expression system can be useful under conditions where it is desirable to produce an 
immunogenic fragment of a hedgehog protein. For example, the VP6 capsid protein of 
5 rotavirus can be used as an immunologic carrier protein for portions of the hh polypeptide, 
either in the monomelic form or in the form of a viral particle. The nucleic acid sequences 
* corresponding to the portion of a subject hedgehog protein to which antibodies are to be 

raised can be incorporated into a fusion gene construct which includes coding sequences for a 
late vaccinia virus structural protein to produce a set of recombinant viruses expressing 
10 fusion proteins comprising hh epitopes as part of the virion. It has been demonstrated with 
the use of immunogenic fusion proteins utilizing the Hepatitis B surface antigen fusion 
proteins that recombinant Hepatitis B virions can be utilized in this role as well. Similarly, 
chimeric constructs coding for fusion proteins containing a portion of an hh protein and the 
poliovirus capsid protein can be created to enhance immunogenicity of the set of polypeptide 
O 15 antigens (see, for example, EP Publication No: 0259149; and Evans et aL (1989) Nature 
2 339:385; Huang et al. (1988) J. Virol 62:3855; and Schlienger et al. (1992) J. Virol 66:2). 



-a 
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The Multiple Antigen Peptide system for peptide-based immunization can also be 
4f! utilized to generate an immunogen, wherein a desired portion of an hh polypeptide is obtained 

Jij directly from organo-chemical synthesis of the peptide onto an oligomeric branching lysine 

* 20 core (see, for example, Posnett et al. (1988) JBC 263:1719 and Nardelli et al. (1992) 
P J. Immunol 148:914). Antigenic determinants of hh proteins can also be expressed and 



presented by bacterial cells. 



^0 In addition to utilizing fusion proteins to enhance immunogenicity, it is widely 

ff* appreciated that fusion proteins can also facilitate the expression of proteins, and accordingly, 

25 can be used in the expression of the vertebrate hh polypeptides of the present invention. For 
example, hedgehog polypeptides can be generated as glutathione-S-transferase (GST-fusion) 
proteins. Such GST-fusion proteins can enable easy purification of the hedgehog 
polypeptide, as for example by the use of glutathione-derivatized matrices (see, for example, 
Current Protocols in Molecular Biology, eds. Ausubel et al. (N.Y.: John Wiley & Sons, 

30 1991)). In another embodiment, a fusion gene coding for a purification leader sequence, such 
as a poly-(His)/enterokinase cleavage site sequence, can be used to replace the signal 
sequence which naturally occurs at the N-teraiinus of the hh protein (e.g.of the pro-form, in 
order to pennit purification of the poly(His)-/iA protein by affinity chromatography using a 
Ni 2+ metal resin. The purification leader sequence can then be subsequently removed by 

35 treatment with enterokinase (e.g., see Hochuli et al. (1987) J. Chromatography 41 1 :177; and 
Janknecht et al. PNAS 88:8972). 
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Techniques for making fusion genes are known to those skilled in the art. Essentially, 
the joining of various DNA fragments coding for different polypeptide sequences is 
performed in accordance with conventional techniques, employing blunt-ended or stagger- 
ended termini for ligation, restriction enzyme digestion to provide for appropriate termini, 
5 filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable 
joining, and enzymatic ligation. In another embodiment, the fusion gene can be synthesized 
by conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers which give rise to 
complementary overhangs between two consecutive gene fragments which can subsequently 
10 be annealed to generate a chimeric gene sequence (see, for example, Current Protocols in 
Molecular Biology y eds. Ausubel et al. John Wiley & Sons: 1992). 

^ Hedgehog polypeptides may also be chemically modified to create hh derivatives by 

forming covalent or aggregate conjugates with other chemical moieties, such as glycosyl 
^ groups, lipids, phosphate, acetyl groups and the like. Covalent derivatives of hedgehog 

j\ 15 proteins can be prepared by linking the chemical moieties to functional groups on amino acid 
*\ sidechains of the protein or at the N-terminus or at the C-terminus of the polypeptide. 



a 



For instance, hedgehog proteins can be generated to include a moiety, other than 
sequence naturally associated with the protein, that binds a component of the extracellular 
^ matrix and enhances localization of the analog to cell surfaces. For example, sequences 

q 20 derived from the fibronectin "type-Ill repeat", such as a tetrapeptide sequence R-G-D-S 
H (Pierschbacher et al. (1984) Nature 309:30-3; and Kornblihtt et al. (1985) EMBO 4:1755-9) 

J^J can be added to the hh polypeptide to support attachment of the chimeric molecule to a cell 

through binding ECM components (Ruoslahti et al. (1987) Science 238:491-497; 
Pierschbacheret al. (1987) J. Biol Chem. 262:17294-8.; Hynes (1987) Cell 48:549-54; and 
25 Hynes (1992) Cell 69:1 1-25). 

The present invention also makes available isolated hedgehog polypeptides which are 
isolated from, or otherwise substantially free of other cellular and extracellular proteins, 
especially morphogenic proteins or other extracellular or cell surface associated proteins 
which may normally be associated with the hedgehog polypeptide. The term "substantially 
30 free of other cellular or extracellular proteins" (also referred to herein as "contaminating 
proteins") or "substantially pure or purified preparations" are defined as encompassing 
preparations of hh polypeptides having less than 20% (by dry weight) contaminating protein, 
and preferably having less than 5% contaminating protein. Functional forms of the subject 
polypeptides can be prepared, for the first time, as purified preparations by using a cloned 
35 gene as described herein. By "purified", it is meant, when referring to a peptide or DNA or 
RNA sequence, that the indicated molecule is present in the substantial absence of other 
biological macromolecules, such as other proteins. The term "purified" as used herein 
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preferably means at least 80% by dry weight, more preferably in the range of 95-99% by 
weight, and most preferably at least 99.8% by weight, of biological macromolecules of the 
same type present (but water, buffers, and other small molecules, especially molecules 
having a molecular weight of less than 5000, can be present). The term "pure" as used herein 
5 preferably has the same numerical limits as "purified" immediately above. "Isolated" and 
"purified" do not encompass either natural materials in their native state or natural materials 
that have been separated into components (e.g., in an acrylamide gel) but not obtained either 
as pure (e.g. lacking contaminating proteins, or chromatography reagents such as denaturing 
agents and polymers, e.g. acrylamide or agarose) substances or solutions. In preferred 
10 embodiments, purified hedgehog preparations will lack any contaminating proteins from the 
same animal from that hedgehog is normally produced, as can be accomplished by 
recombinant expression of, for example, a human hedgehog protein in a non-human cell. 

As described above for recombinant polypeptides, isolated hh polypeptides can 
include all or a portion of the amino acid sequences represented in SEQ ID No:8, SEQ ID 
1 5 No:9, SEQ ID No: 1 0, SEQ ID No: 1 1 , SEQ ID No: 12, SEQ ID No: 1 3 or SEQ ID No: 14, or a 
f*j homologous sequence thereto. Preferred fragments of the subject hedgehog proteins 

jr correspond to the N-terminal and C-terminal proteolytic fragments of the mature protein (see, 

Pi for instance, Examples 6 and 9). Bioactive fragments of hedgehog polypeptides are 

% described in great detail in USSN 08/435,093, filed May 4, 1995, herein incorporated by 

a 20 reference. 



ICS? 

Hi 



Isolated peptidyl portions of hedgehog proteins can be obtained by screening peptides 
recombinantly produced from the corresponding fragment of the nucleic acid encoding such 
peptides. In addition, fragments can be chemically synthesized using techniques known in 
the art such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, a 
25 hedgehog polypeptide of the present invention may be arbitrarily divided into fragments of 
desired length with no overlap of the fragments, or preferably divided into overlapping 
fragments of a desired length. The fragments can be produced (recombinantly or by chemical 
synthesis) and tested to identify those peptidyl fragments which can function as either 
agonists or antagonists of a wild-type (e.g., "authentic") hedgehog protein. 

30 The recombinant hedgehog polypeptides of the present invention also include 

homologs of the authentic hedgehog proteins, such as versions of those protein which are 
resistant to proteolytic cleavage, as for example, due to mutations which alter potential 
cleavage sequences or which inactivate an enzymatic activity associated with the protein. 
Hedgehog homologs of the present invention also include proteins which have been post- 
35 translationally modified in a manner different than the authentic protein. Exemplary 
derivatives of vertebrate hedgehog proteins include polypeptides which lack N-glycosylation 
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sites (e.g. to produce an unglycosylated protein), or which lack N-terminal and/or C-tenninal 
sequences. 

Modification of the structure of the subject vertebrate hh polypeptides can be for such 
purposes as enhancing therapeutic or prophylactic efficacy, or stability (e.g., ex vivo shelf life 
and resistance to proteolytic degradation in vivo). Such modified peptides, when designed to 
retain at least one activity of the naturally-occurring form of the protein, are considered 
functional equivalents of the hedgehog polypeptides described in more detail herein. Such 
modified peptides can be produced, for instance, by amino acid substitution, deletion, or 
addition. 

For example, it is reasonable to expect that an isolated replacement of a leucine with 
an isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a similar 
replacement of an amino acid with a structurally related amino acid (i.e. isosteric and/or 
isoelectric mutations) will not have a major effect on the biological activity of the resulting 
molecule. Conservative replacements are those that take place within a family of amino acids 
that are related in their side chains. Genetically encoded amino acids are can be divided into 
four families: (1) acidic = aspartate, glutamate; (2) basic = lysine, arginine, histidine; (3) 
nonpolar = alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
tryptophan; and (4) uncharged polar = glycine, asparagine, glutamine, cysteine, serine, 
threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine are sometimes classified jointly 
as aromatic amino acids. In similar fashion, the amino acid repertoire can be grouped as (1) 
acidic = aspartate, glutamate; (2) basic = lysine, arginine histidine, (3) aliphatic = glycine, 
alanine, valine, leucine, isoleucine, serine, threonine, with serine and threonine optionally be 
grouped separately as aliphatic-hydroxyl; (4) aromatic - phenylalanine, tyrosine, tryptophan; 
(5) amide = asparagine, glutamine; and (6) sulfur -containing = cysteine and methionine, 
(see, for example, Biochemistry, 2nd ed., Ed. by L. Stryer, WH Freeman and Co.: 1981). 
Whether a change in the amino acid sequence of a peptide results in a functional hedgehog 
homolog (e.g. functional in the sense that it acts to mimic or antagonize the wild-type form) 
can be readily determined by assessing the ability of the variant peptide to produce a response 
in cells in a fashion similar to the wild-type protein, or competitively inhibit such a response. 
Polypeptides in which more than one replacement has taken place can readily be tested in the 
same manner. 

This invention further contemplates a method for generating sets of combinatorial 
mutants of the subject hedgehog proteins as well as truncation mutants, and is especially 
useful for identifying potential variant sequences (e.g. homologs) that are functional in 
binding to a receptor for hedgehog proteins. The purpose of screening such combinatorial 
libraries is to generate, for example, novel hh homologs which can act as either agonists or 
antagonist, or alternatively, possess novel activities all together. To illustrate, hedgehog 
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homologs can be engineered by the present method to provide more efficient binding to a 
cognate receptor, yet still retain at least a portion of an activity associated with hh. Thus, 
combinatorially-derived homologs can be generated to have an increased potency relative to a 
naturally occurring form of the protein. Likewise, hedgehog homologs can be generated by 
the present combinatorial approach to act as antagonists, in that they are able to mimic, for 
example, binding to other extracellular matrix components (such as receptors), yet not induce 
any biological response, thereby inhibiting the action of authentic hedgehog or hedgehog 
agonists. Moreover, manipulation of certain domains of hh by the present method can 
provide domains more suitable for use in fusion proteins, such as one that incorporates 
portions of other proteins which are derived from the extracellular matrix and/or which bind 
extracellular matrix components. 

In one aspect of this method, the amino acid sequences for a population of hedgehog 
homologs or other related proteins are aligned, preferably to promote the highest homology 
possible. Such a population of variants can include, for example, hh homologs from one or 
more species. Amino acids which appear at each position of the aligned sequences are 
selected to create a degenerate set of combinatorial sequences. In a preferred embodiment, 
the variegated library of hedgehog variants is generated by combinatorial mutagenesis at the 
nucleic acid level, and is encoded by a variegated gene library. For instance, a mixture of 
synthetic oligonucleotides can be enzymatically ligated into gene sequences such that the 
degenerate set of potential hh sequences are expressible as individual polypeptides, or 
alternatively, as a set of larger fusion proteins (e.g. for phage display) containing the set of hh 
sequences therein. 

As illustrated in Figure 5 A, to analyze the sequences of a population of variants, the 
amino acid sequences of interest can be aligned relative to sequence homology. The presence 
or absence of amino acids from an aligned sequence of a particular variant is relative to a 
chosen consensus length of a reference sequence, which can be real or artificial. In order to 
maintain the highest homology in alignment of sequences, deletions in the sequence of a 
variant relative to the reference sequence can be represented by an amino acid space (• or *), 
while insertional mutations in the variant relative to the reference sequence can be 
disregarded and left out of the sequence of the variant when aligned. For instance, Figure 5A 
includes the alignment of several cloned forms of hh from different species. Analysis of the 
alignment of the hh clones shown in Figure 5 A can give rise to the generation of a degenerate 
library of polypeptides comprising potential hh sequences. 

In an illustrative embodiment, alignment of exons 1, 2 and a portion of exon 3 
encoded sequences (e.g. the N-terminal approximately 221 residues of the mature protein) of 
each of the Shh clones produces a degenerate set of Shh polypeptides represented by the 
general formula: 
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C-G-P-G-R-G-X(l) -G -X{2) -R-R-H-P-K-K-L-T-P-L-A- Y-K-Q-F- I-P-N-V- 
A-E-K-T-L-G-A-S-G-R-Y-E-G-K-I-X<3) -R-N-S-E-R-F-K-E-L-T-P-N-Y-N- 
P-D-I-I-F-K-D-E-E-N-T-G-A-D-R-L-M-T-Q-R-C-K-D-K-L-N-X(4) -L-A-I- 
S-V-M-N-X (5) -W-P-G-V-X{6) -L-R-V-T-E-G-W-D-E-D-G-H-H-X (7) -B-E-S- 
L-H-Y-E-G-R-A-V-D-I-T-T-S-D-R-D-X (8) -S-K-Y-G -X (9) -L-X (10) -R-L- 
A-V-E-A-G-F-D-W-V-Y-Y-E-S-K-A-H-I-H-C-S-V-K-A-E-N-S-V-A-A-K-S- 
G-G-C-F-P-G-S-A-X(ll) -V-X(12) -L-X(13) -X(14) -G-G-X(15) -K-X- (16) - 
V-K-D-L-X{17) -P-G-D-X (18) -V-L-A-A-D-X (19) -X(20) -G-X(21) -L- 
X(22) -X(23) -S-D-F-X(24) -X(25) -F-X(26) -D-R (SEQ ID No: 40) , 

wherein each of the degenerate positions "X M can be an amino acid which occurs in that 
position in one of the human, mouse, chicken or zebrafish Shh clones, or, to expand the 
library, each X can also be selected from amongst amino acid residue which would be 
conservative substitutions for the amino acids which appear naturally in each of those 
positions. For instance, Xaa(l) represents Gly, Ala, Val, Leu, lie, Phe, Tyr or Trp ; Xaa(2) 
represents Arg, His or Lys; Xaa(3) represents Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(4) 
represents Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(5) represents Lys, Arg, His, Asn or Gin; 
Xaa(6) represents Lys, Arg or His; Xaa(7) represents Ser, Thr, Tyr, Trp or Phe; Xaa(8) 
represents Lys, Arg or His; Xaa(9) represents Met, Cys, Ser or Thr; Xaa(10) represents Gly, 
Ala, Val, Leu, He, Ser or Thr, Xaa(ll) represents Leu, Val, Met, Thr or Ser; Xaa(12) 
represents His, Phe, Tyr, Ser, Thr, Met or Cys; Xaa(13) represents Gin, Asn, Glu, or Asp; 
Xaa(14) represents His, Phe, Tyr, Thr, Gin, Asn, Glu or Asp; Xaa(15) represents Gin, Asn, 
Glu, Asp, Thr, Ser, Met or Cys; Xaa(16) represents Ala, Gly, Cys, Leu, Val or Met; Xaa(17) 
represents Arg, Lys, Met, He, Asn, Asp, Glu, Gin, Ser, Thr or Cys; Xaa(18) represents Arg, 
Lys, Met or lie; Xaa(19) represents Ala, Gly, Cys, Asp, Glu, Gin, Asn, Ser, Thr or Met; 
Xaa(20) represents Ala, Gly, Cys, Asp, Asn, Glu or Gin; Xaa(21) represents Arg, Lys, Met, 
He, Asn, Asp, Glu or Gin; Xaa(22) represent Leu, Val, Met or He; Xaa(23) represents Phe, 
Tyr, Thr, His or Trp; Xaa(24) represents He, Val, Leu or Met; .Xaa(25) represents Met, Cys, 
lie, Leu, Val, Thr or Ser; Xaa(26) represents Leu, Val, Met, Thr or Ser. In an even more 
expansive library, each X can be selected from any amino acid. 

In similar fashion, alignment of each of the human, mouse, chicken and zebrafish 
hedgehog clones (Figure 5B), can provide a degenerate polypeptide sequence represented by 
the general formula: 

C-G-P-G-R-G-X(l) -X(2) -X(3) -R-R-X(4) -X(5) -X(6) -P-K-X{7) -L-X (8) - 
P-L-X(9) -Y-K-Q-F-X(IO) -P-X(ll) -X(12) -X {13 ) -E-X (14 ) -T-L-G-A-S-G- 
X(15) -X(16) -E-G-X(17) -X(18) -X(19) -R-X(20) -S-E-R-F-X (21) -X(22) - 
L-T-P-N-Y-N-P-D-I-I-F-K-D-E-B-N -X(23) -G-A-D-R-L-M- T-X (24) -R-C- 
K-X(25) -X(26) -X(27) -N-X(28> -L-A- I-S-V-M-N-X (29) -W-P-G-V-X (30) - 
L-R-V-T-E-G-X(31) -D-E-D-G-H-H-X (32) -X(33) -X(34) -S-L-H-Y-E-G-R- 
A-X(35> -D-I-T-T-S-D-R-D-X (36) -X(37) -K-Y-G-X(38) -L-X (39) -R-L-A- 
V-B-A-G-F-D-W-V-Y-Y-E-S-X(40) -X(41) -H-X(42) -H-X{43) -S-V-K-X(44) 
-X(45) (SEQ ID No: 41) , 
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wherein, as above, each of the degenerate positions "X" can be an amino acid which occurs in 
a corresponding position in one of the wild-type clones, and may also include amino acid 
residue which would be conservative substitutions, or each X can be any amino acid residue. 
In an exemplary embodiment, Xaa(l) represents Gly, Ala, Val, Leu, He, Pro, Phe or Tyr; 
5 Xaa(2) represents Gly, Ala, Val, Leu or He; Xaa(3) represents Gly, Ala, Val, Leu, lie, Lys, 
His or Arg; Xaa(4) represents Lys, Arg or His; Xaa(5) represents Phe, Trp, Tyr or an amino 
acid gap; Xaa(6) represents Gly, Ala, Val, Leu, lie or an amino acid gap; Xaa(7) represents 
Asn, Gin, His, Arg or Lys; Xaa(8) represents Gly, Ala, Val, Leu, lie, Ser or Thr; Xaa(9) 
represents Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(10) represents Gly, Ala, Val, Leu, He, Ser 
10 or Thr; Xaa(l 1) represents Ser, Thr, Gin or Asn; Xaa(12) represents Met, Cys, Gly, Ala, Val, 
Leu, He, Ser or Thr, Xaa(13) represents Gly, Ala, Val, Leu, He or Pro; Xaa(14) represents 
Arg, His or Lys; Xaa(15) represents Gly, Ala, Val, Leu, He, Pro, Arg, His or Lys; Xaa(16) 
represents Gly, Ala, Val, Leu, He, Phe or Tyr; Xaa(17) represents Arg, His or Lys; Xaa(18) 
represents Gly, Ala, Val, Leu, He, Ser or Thr, Xaa(19) represents Thr or Ser; Xaa(20) 
P 15 represents Gly, Ala, Val, Leu, He, Asn or Gin; Xaa(21) represents Arg, His or Lys; Xaa(22) 
p represents Asp or Glu; Xaa(23) represents Ser or Thr; Xaa(24) represents Glu, Asp, Gin or 

SI Asn; Xaa(25) represents Glu or Asp; Xaa(26) represents Arg, His or Lys; Xaa(27) represents 

jj" Gly, Ala, Val, Leu or He; Xaa(28) represents Gly, Ala, Val, Leu, He, Thr or Ser; Xaa(29) 

p represents Met, Cys, Gin, Asn, Arg, Lys or His; Xaa(30) represents Arg, His or Lys; Xaa(3 1) 

*3 20 represents Trp, Phe, Tyr, Arg, His or Lys; Xaa(32) represents Gly, Ala, Val, Leu, He, Ser, 
p Thr, Tyr or Phe; Xaa(33) represents Gin, Asn, Asp or Glu; Xaa(34) represents Asp or Glu; 

SI Xaa(35) represents Gly, Ala, Val, Leu, or He; Xaa(36) represents Arg, His or Lys; Xaa(37) 

jy represents Asn, Gin, Thr or Ser, Xaa(38) represents Gly, Ala, Val, Leu, He, Ser, Thr, Met or 

Cys; Xaa(39) represents Gly, Ala, Val, Leu, He, Thr or Ser; Xaa(40) represents Arg, His or 
^ 25 Lys; Xaa(41) represents Asn, Gin, Gly, Ala, Val, Leu or He; Xaa(42) represents Gly, Ala, 
Val, Leu or He; Xaa(43) represents Gly, Ala, Val, Leu, lie, Ser, Thr or Cys; Xaa(44) 
represents Gly, Ala, Val, Leu, He, Thr or Ser; and Xaa(45) represents Asp or Glu. 

There are many, ways by which the library of potential hh homologs can be generated 
from a degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene 

30 sequence can be carried out in an automatic DNA synthesizer, and the synthetic genes then 
ligated into an appropriate expression vector. The purpose of a degenerate set of genes is to 
provide, in one mixture, all of the sequences encoding the desired set of potential hh 
sequences. The synthesis of degenerate oligonucleotides is well known in the art (see for 
example, Narang, SA (1983) Tetrahedron 39:3; Itakura et aL (1981) Recombinant DNA, 

35 Proc 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, Amsterdam: Elsevier pp273- 
289; Itakura et al. (1984) Amu. Rev, Biochem. 53:323; Itakura et al. (1984) Science 
198:1056; Ike et al. (1983) Nucleic Acid Res. 11:477. Such techniques have been employed 
in the directed evolution of other proteins (see, for example, Scott et al. (1990) Science 
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249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; Devlin et al. (1990) Science 249: 
404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. Patents Nos. 5,223,409, 
5,198,346, and 5,096,815). 

A wide range of techniques are known in the art for screening gene products of 
5 combinatorial libraries made by point mutations, and for screening cDNA libraries for gene 
products having a certain property. Such techniques will be generally adaptable for rapid 
screening of the gene libraries generated by the combinatorial mutagenesis of hedgehog 
homologs. The most widely used techniques for screening large gene libraries typically 
comprises cloning the gene library into replicable expression vectors, transforming 

10 appropriate cells with the resulting library of vectors, and expressing the combinatorial genes 
under conditions in which detection of a desired activity facilitates relatively easy isolation of 
the vector encoding the gene whose product was detected. Each of the illustrative assays 
described below are amenable to high through-put analysis as necessary to screen large 
numbers of degenerate hedgehog sequences created by combinatorial mutagenesis 

15 techniques. 

In one embodiment, the combinatorial library is designed to be secreted (e.g. the 
polypeptides of the library all include a signal sequence but no transmembrane or 
cytoplasmic domains), and is used to transfect a eukaryotic cell that can be co-cultured with 
embryonic cells. A functional hedgehog protein secreted by the cells expressing the 

20 combinatorial library will diffuse to neighboring embryonic cells and induce a particular 
biological response, such as to illustrate, neuronal differentiation. Using antibodies directed 
to epitopes of particular neuronal cells (e.g. Islet- 1 or Pax-1), the pattern of detection of 
neuronal induction will resemble a gradient function, and will allow the isolation (generally 
after several repetitive rounds of selection) of cells producing active hedgehog homologs. 

25 Likewise, hh antagonists can be selected in similar fashion by the ability of the cell producing 
a functional antagonist to protect neighboring cells from the effect of wild-type hedgehog 
added to the culture media. 

To illustrate, target cells are cultured in 24-weIl microtitre plates. Other eukaryotic 
cells are transfected with the combinatorial hh gene library and cultured in cell culture inserts 

30 (e.g. Collaborative Biomedical Products, Catalog #40446) that are able to fit into the wells of 
the microtitre plate. The cell culture inserts are placed in the wells such that recombinant hh 
homologs secreted by the cells in the insert can diffuse through the porous bottom of the 
insert and contact the target cells in the microtitre plate wells. After a period of time 
sufficient for functional forms of a hedgehog protein to produce a measurable response in the 

35 target cells, the inserts are removed and the effect of the variant hedgehog proteins on the 
target cells determined. For example, where the target cell is a neural crest cell and the 
activity desired from the hh homolog is the induction of neuronal differentiation, then 
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fluorescently-labeled antibodies specific for Islet- 1 or other neuronal markers can be used to 
score for induction in the target cells as indicative of a functional hh in that well. Cells from 
the inserts corresponding to wells which score positive for activity can be split and re- 
cultured on several inserts, the process being repeated until the active clones are identified. 

5 In yet another screening assay, the candidate hedgehog gene products are displayed on 

the surface of a cell or viral particle, and the ability of particular cells or viral particles to 
associate with a hedgehog-bm<\m% moiety (such as an hedgehog receptor or a ligand which 
binds the hedgehog protein) via this gene product is detected in a "panning assay". Such 
panning steps can be carried out on cells cultured from embryos. For instance, the gene 
10 library can be cloned into the gene for a surface membrane protein of a bacterial cell, and the 
resulting fusion protein detected by panning (Ladner et al., WO 88/06630; Fuchs et al. (1991) 
Bio/Technology 9:1370-1371; and Goward et al. (1992) TIBS 18:136-140). In a similar 
fashion, fluorescently labeled molecules which bind hh can be used to score for potentially 
p functional hh homologs. Cells can be visually inspected and separated under a fluorescence 

60 15 microscope, or, where the morphology of the cell permits, separated by a fluorescence- 
J activated cell sorter. 

•a 

Hi In an alternate embodiment, the gene library is expressed as a fusion protein on the 

Q surface of a viral particle. For instance, in the filamentous phage system, foreign peptide 

sequences can be expressed on the surface of infectious phage, thereby conferring two 
p 20 significant benefits. First, since these phage can be applied to affinity matrices at very high 
N concentrations, large number of phage can be screened at one time. Second, since each 

jy infectious phage displays the combinatorial gene product on its surface, if a particular phage 

\Q is recovered from an affinity matrix in low yield, the phage can be amplified by another 

P* 1 round of infection. The group of almost identical Rcoli filamentous phages Ml 3, fd, and fl 

25 are most often used in phage display libraries, as either of the phage gill or gVIII coat 
proteins can be used to generate fusion proteins without disrupting the ultimate packaging of 
the viral particle (Ladner et al. PCT publication WO 90/02909; Garrard et al., PCT 
publication WO 92/09690; Marks et al. (1992) J. Biol. Chem. 267:16007-16010; Grififths et 
al. (1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; and Barbas et al. 
30 (1992) PNAS 89:4457-4461). 

In an illustrative embodiment, the recombinant phage antibody system (RPAS, 
Pharamacia Catalog number 27-9400-01) can be easily modified for use in expressing and 
screening hh combinatorial libraries. For instance, the pCANTAB 5 phagemid of the RPAS 
kit contains the gene which encodes the phage gill coat protein. The hh combinatorial gene 
35 library can be cloned into the phagemid adjacent to the gin signal sequence such that it will 
be expressed as a gill fusion protein. After ligation, the phagemid is used to transform 
competent E, coli TGI cells. Transformed cells are subsequently infected with M13K07 
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helper phage to rescue the phagemid and its candidate hh gene insert. The resulting 
recombinant phage contain phagemid DNA encoding a specific candidate hh, and display one 
or more copies of the corresponding fusion coat protein. The phage-displayed candidate 
hedgehog proteins which are capable of binding an hh receptor are selected or enriched by 
panning. For instance, the phage library can be applied to cultured embryonic cells and 
unbound phage washed away from the cells. The bound phage is then isolated, and if the 
recombinant phage express at least one copy of the wild type gill coat protein, they will retain 
their ability to infect E. colu Thus, successive rounds of reinfection of E. coli, and panning 
will greatly enrich for hh homologs, which can then be screened for further biological 
activities in order to differentiate agonists and antagonists. 

Combinatorial mutagenesis has a potential to generate very large libraries of mutant 
proteins, e.g., in the order of 10 26 molecules. Combinatorial libraries of this size may be 
technically challenging to screen even with high throughput screening assays such as phage 
display. To overcome this problem, a new technique has been developed recently, recrusive 
ensembel mutagenesis (REM), which allows one to avoid the very high proportion of non- 
functional proteins in a random library and simply enhances the frequency of functional 
proteins, thus decreasing the complexity required to achieve a useful sampling of sequence 
space. REM is an algorithm which enhances the frequency of functional mutants in a library 
when an appropriate selection or screening method is employed (Arkin and Yourvan, 1992, 
PNAS USA 89:7811-7815; Yourvan et al., 1992, Parallel Problem Solving from Nature, 2., In 
Maenner and Manderick, eds., Elsevir Publishing Co., Amsterdam, pp. 401-410; Delgrave et 
al., 1993, Protein Engineering 6(3):327-331). 

The invention also provides for reduction of the vertebrate hh protein to generate 
mimetics, e.g. peptide or non-peptide agents, which are able to disrupt binding of a vertebrate 
hh polypeptide of the present invention with an hh receptor. Thus, such mutagenic 
techniques as described above are also useful to map the determinants of the hedgehog 
proteins which participate in protein-protein interactions involved in, for example, binding of 
the subject vertebrate hh polypeptide to other extracellular matrix components. To illustrate, 
the critical residues of a subject hh polypeptide or hh ligand which are involved in molecular 
recognition of an hh receptor can be determined and used to generate hedgehog-derived 
peptidomimetics which competitively inhibit binding of the authentic hedgehog protein with 
that moiety. By employing, for example, scanning mutagenesis to map the amino acid 
residues of each of the subject hedgehog proteins which are involved in binding other 
extracellular proteins, peptidomimetic compounds can be generated which mimic those 
residues of the hedgehog protein which facilitate the interaction. Such mimetics may then be 
used to interfere with the normal function of a hedgehog protein. For instance, non- 
hydrolyzable peptide analogs of such residues can be generated using benzodiazepine (e.g., 
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see Freidinger et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM 
Publisher Leiden, Netherlands, 1988), azepine (e.g., see Huffman et al. in Peptides: 
Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher; Leiden, Netherlands, 1988), 
substituted gama lactam rings (Garvey et al. in Peptides: Chemistry and Biology, G.R. 
Marshall ed., ESCOM Publisher Leiden, Netherlands, 1988), keto-methylene pseudopeptides 
(Ewenson et al. (1986) J Med Chem 29:295; and Ewenson et al. in Peptides: Structure and 
Function (Proceedings of the 9th American Peptide Symposium) Pierce Chemical Co. 
Rockland, IL, 1985), p-turn dipeptide cores (Nagai et al. (1985) Tetrahedron Lett 26:647; and 
Sato et al. (1986) J Chem Soc Perkin Trans 1:1231), and p-aminoalcohols (Gordon et al. 
(1985) Biochem Biophys Res Commit 126:4 19; and Dann et al. (1986) Biochem Biophys Res 
Commun 134:71). 

Another aspect of the invention pertains to an antibody specifically reactive with a 
vertebrate hedgehog protein. For example, by using immunogens derived from hedgehog 
protein, e.g. based on the cDNA sequences, anti-protein/anti-peptide antisera or monoclonal 
antibodies can be made by standard protocols (See, for example, Antibodies: A Laboratory 
Manual ed. by Harlow and Lane (Cold Spring Harbor Press: 1988)). A mammal, such as a 
mouse, a hamster or rabbit can be immunized with an immunogenic form of the peptide (e.g., 
a vertebrate hh polypeptide or an antigenic fragment which is capable of eliciting an antibody 
response). Techniques for conferring immunogenicity on a protein or peptide include 
conjugation to carriers or other techniques well known in the art. An immunogenic portion 
of a hedgehog protein can be administered in the presence of adjuvant. The progress of 
immunization can be monitored by detection of antibody titers in plasma or serum. Standard 
ELISA or other immunoassays can be used with the immunogen as antigen to assess the 
levels of antibodies. In a preferred embodiment, the subject antibodies are immunospecific 
for antigenic determinants of a hedgehog protein of a vertebrate organism, such as a 
mammal, e.g. antigenic determinants of a protein represented by SEQ ID Nos:8-14 or a 
closely related homolog (e.g. at least 85% homologous, preferably at least 90% homologous, 
and more preferably at least 95% homologous). In yet a further preferred embodiment of the 
present invention, in order to provide, for example, antibodies which are immuno-selective 
for discrete hedgehog homologs, e.g. Shh versus Dhh versus Ihh, the anti-hh polypeptide 
antibodies do not substantially cross react (i.e. does not react specifically) with a protein 
which is, for example, less than 85% homologous to any of SEQ ID Nos:8-14; e.g., less than 
95% homologous with one of SEQ ED Nos:8-14; e.g., less than 98-99% homologous with 
one of SEQ ID Nos:8-14. By "not substantially cross react", it is meant that the antibody has 
a binding affinity for a non-homologous protein which is at least one order of magnitude, 
more preferably at least 2 orders of magnitude, and even more preferably at least 3 orders of 
magnitude less than the binding affinity of the antibody for one or more of the proteins of 
SEQIDNos:8-14. 
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Following immunization of an animal with an antigenic preparation of a hedgehog 
protein, anti-M antisera can be obtained and, if desired, polyclonal anti-AA antibodies isolated 
from the serum. To produce monoclonal antibodies, antibody-producing cells (lymphocytes) 
can be harvested from an immunized animal and fused by standard somatic cell fusion 
procedures with immortalizing cells such as myeloma cells to yield hybridoma cells. Such 
techniques are well known in the art, an include, for example, the hybridoma technique 
(originally developed by Kohler and Milstein, (1975) Nature, 256: 495-497), the human B 
cell hybridoma technique (Kozbar et al., (1983) Immunology Today, 4: 72), and the EBV- 
hybridoma technique to produce human monoclonal antibodies (Cole et al, (1985) 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. pp. 77-96). Hybridoma cells 
can be screened immunochemically for production of antibodies specifically reactive with a 
vertebrate hh polypeptide of the present invention and monoclonal antibodies isolated from a 
culture comprising such hybridoma cells. 

The term antibody as used herein is intended to include fragments thereof which are 
also specifically reactive with one of the subject vertebrate hh polypeptides. Antibodies can 
be fragmented using conventional techniques and the fragments screened for utility in the 
same manner as described above for whole antibodies. For example, F(ab)2 fragments can 
be generated by treating antibody with pepsin. The resulting F(ab)2 fragment can be treated 
to reduce disulfide bridges to produce Fab fragments. The antibody of the present invention 
is further intended to include bispecific and chimeric molecules having affinity for a 
hedgehog protein conferred by at least one CDR region of the antibody. 

Both monoclonal and polyclonal antibodies (Ab) directed against authentic hedgehog 
polypeptides, or hedgehog variants, and antibody fragments such as Fab and F(ab)2> can be 
used to block the action of one or more hedgehog proteins and allow the study of the role of 
these proteins in, for example, embryogenesis and/or maintenance of differential tissue. For 
example, purified monoclonal Abs can be injected directly into the limb buds of chick or 
mouse embryos. It is demonstrated in the examples below that hh is expressed in the limb 
buds of, for example, day 10.5 embryos. Thus, the use of anti-M Abs during this 
developmental stage can allow assessment of the effect of hh on the formation of limbs in 
vivo. In a similar approach, hybridomas producing anti-hh monoclonal Abs, or 
biodegradable gels in which anti-hh Abs are suspended, can be implanted at a site proximal 
or within the area at which hh action is intended to be blocked. Experiments of this nature 
can aid in deciphering the role of this and other factors that may be involved in limb 
patterning and tissue formation. 

Antibodies which specifically bind hedgehog epitopes can also be used in 
immunohistochemical staining of tissue samples in order to evaluate the abundance and 
pattern of expression of each of the subject hh polypeptides. Anti-hedgehog antibodies can 
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be used diagnostically in immuno-precipitation and immuno-blotting to detect and evaluate 
hedgehog protein levels in tissue as part of a clinical testing procedure. For instance, such 
measurements can be useful in predictive valuations of the onset or progression of 
neurological disorders, such as those marked by denervation-like or disuse-like symptoms. 
5 Likewise, the ability to monitor hh levels in an individual can allow determination of the 
efficacy of a given treatment regimen for an individual afflicted with such a disorder. The 
level of hh polypeptides may be measured in bodily fluid, such as in samples of cerebral 
spinal fluid or amniotic fluid, or can be measured in tissue, such as produced by biopsy. 
Diagnostic assays using anti-AA antibodies can include, for example, immunoassays designed 
10 to aid in early diagnosis of a neurodegenerative disorder, particularly ones which are manifest 
at birth. Diagnostic assays using anti-M polypeptide antibodies can also include 
immunoassays designed to aid in early diagnosis and phenotyping of a differentiative 
disorder, as well as neoplastic or hyperplastic disorders. 

P Another application of anti-/*A antibodies of the present invention is in the 

00 15 immunological screening of cDNA libraries constructed in expression vectors such as Xgtl 1, 
Xgtl8-23, XZAP, and A.ORF8. Messenger libraries of this type, having coding sequences 
inserted in the correct reading frame and orientation, can produce fusion proteins. For 
^ instance, Xgtll will produce fusion proteins whose amino termini consist of B-galactosidase 

yg amino acid sequences and whose carboxy termini consist of a foreign polypeptide. Antigenic 

* 20 epitopes of an hh protein, e.g. other orthologs of a particular hedgehog protein or other 

q 

■isj homologs from the same species, can then be detected with antibodies, as, for example, 

p reacting nitrocellulose filters lifted from infected plates with smli-hh antibodies. Positive 
phage detected by this assay can then be isolated from the infected plate. Thus, the presence 

(ft of hedgehog homologs can be detected and cloned from other animals, as can alternate 

25 isoforms (including splicing variants) from humans. 

Moreover, the nucleotide sequences determined from the cloning of hh genes from 
vertebrate organisms will further allow for the generation of probes and primers designed for 
use in identifying and/or cloning hedgehog homologs in other cell types, e.g. from other 
tissues, as well as hh homologs from other vertebrate organisms. For instance, the present 

30 invention also provides a probe/primer comprising a substantially purified oligonucleotide, 
which oligonucleotide comprises a region of nucleotide sequence that hybridizes under 
stringent conditions to at least 10 consecutive nucleotides of sense or anti-sense sequence 
selected from the group consisting of SEQ ID No:l, SEQ ID No:2, SEQ ID No:3, SEQ ID 
No:4, SEQ ID No:5, SEQ ID No:6 and SEQ ID No:7, or naturally occurring mutants thereof. 

35 For instance, primers based on the nucleic acid represented in SEQ ID Nos:l-7 can be used in 
PCR reactions to clone hedgehog homologs. Likewise, probes based on the subject hedgehog 
sequences can be used to detect transcripts or genomic sequences encoding the same or 
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homologous proteins. In preferred embodiments, the probe further comprises a label group 
attached thereto and able to be detected, e.g. the label group is selected from the group 
consisting of radioisotopes, fluorescent compounds, enzymes, and enzyme co-factors. 

Such probes can also be used as a part of a diagnostic test kit for identifying cells or 
tissue which misexpress a hedgehog protein, such as by measuring a level of a hedgehog 
encoding nucleic acid in a sample of cells from a patient; e.g. detecting hh mRNA levels or 
determining whether a genomic hh gene has been mutated or deleted. 

To illustrate, nucleotide probes can be generated from the subject hedgehog genes 
which facilitate histological screening of intact tissue and tissue samples for the presence (or 
absence) of hedgehog-tncodmg transcripts. Similar to the diagnostic uses of anti-hedgehog 
antibodies, the use of probes directed to hh messages, or to genomic hh sequences, can be 
used for both predictive and therapeutic evaluation of allelic mutations which might be 
manifest in, for example, neoplastic or hyperplastic disorders (e.g. unwanted cell growth) or 
abnormal differentiation of tissue. Used in conjunction with immunoassays as described 
above, the oligonucleotide probes can help facilitate the determination of the molecular basis 
for a developmental disorder which may involve some abnormality associated with 
expression (or lack thereof) of a hedgehog protein. For instance, variation in polypeptide 
synthesis can be differentiated from a mutation in a coding sequence. 

Accordingly, the present method provides a method for determining if a subject is at 
risk for a disorder characterized by aberrant control of differentiation or unwanted cell 
proliferation. For instance, the subject assay can be used in the screening and diagnosis of 
genetic and acquired disorders which involve alteration in one or more of the hedgehog 
genes. In preferred embodiments, the subject method can be generally characterized as 
comprising: detecting, in a tissue sample of the subject (e.g. a human patient), the presence 
or absence of a genetic lesion characterized by at least one of (i) a mutation of a gene 
encoding a hedgehog protein or (ii) the mis-expression of a hedgehog gene. To illustrate, 
such genetic lesions can be detected by ascertaining the existence of at least one of (i) a 
deletion of one or more nucleotides from a hedgehog gene, (ii) an addition of one or more 
nucleotides to a hedgehog gene, (iii) a substitution of one or more nucleotides of a hedgehog 
gene, (iv) a gross chromosomal rearrangement of a hedgehog gene, (v) a gross alteration in 
the level of a messenger RNA transcript of an hh gene, (vi) the presence of a non-wild type 
splicing pattern of a messenger RNA transcript of a vertebrate hh gene, and (vii) a non-wild 
type level of a hedgehog protein. In one aspect of the invention there is provided a 
probe/primer comprising an oligonucleotide containing a region of nucleotide sequence 
which is capable of hybridizing to a sense or antisense sequence selected from the group 
consisting of SEQ ID Nos:l-7, or naturally occurring mutants thereof, or 5' or 3' flanking 
sequences or intronic sequences naturally associated with a vertebrate hh gene. The probe is 
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exposed to nucleic acid of a tissue sample; and the hybridization of the probe to the sample 
nucleic acid is detected. In certain embodiments, detection of the lesion comprises utilizing 
the probe/primer in a polymerase chain reaction (PCR) (see, e.g., U.S. Patent No: 4,683,195 
and 4,683,202) or, alternatively, in a ligation chain reaction (LCR) (see, e.g., Landegran et 
al. (1988) Science, 241:1077-1080; and NaKazawa et aL (1944) PNAS 91:360-364) the later 
of which can be particularly useful for detecting point mutations in hedgehog genes. 
Alternatively, immunoassays can be employed to determine the level of hh proteins, either 
soluble or membrane bound. 

Yet another diagnostic screen employs a source of hedgehog protein directly. As 
described herein, hedgehog proteins of the present invention are involved in the induction of 
differentiation. Accordingly, the pathology of certain differentiative and/or proliferative 
disorders can be marked by loss of hedgehog sensitivity by the afflicted tissue. 
Consequently, the response of a tissue or cell sample to an inductive amount of a hedgehog 
protein can be used to detect and characterize certain cellular transformations and 
degenerative conditions. For instance, tissue/cell samples from a patient can be treated with 
a hedgehog agonist and the response of the tissue to the treatment determined. Response 
can be qualified and/or quantified, for example, on the basis of phenotypic change as result 
of hedgehog induction. For example, expression of gene products induced by hedgehog 
treatment can be scored for by immunoassay. The patched protein, for example, is 
upregulated in drosophila in response to Dros-HH, and, in light of the findings herein, a 
presumed vertebrate homolog will similarly be upregulated. Thus, detection of patched 
expression on the cells of the patient sample can permit detection of tissue that is not 
hedgehog-responsive. Likewise, scoring for other phenotypic markers provides a means for 
determining the response to hedgehog. 

Furthermore, by making available purified and recombinant hedgehog polypeptides, 
the present invention facilitates the development of assays which can be used to screen for 
drugs, including hedgehog homologs, which are either agonists or antagonists of the normal 
cellular function of the subject hedgehog polypeptides, or of their role in the pathogenesis of 
cellular differentiation and/or proliferation and disorders related thereto. In one embodiment, 
the assay evaluates the ability of a compound to modulate binding between a hedgehog 
polypeptide and a hedgehog receptor. A variety of assay formats will suffice and, in light of 
the present inventions, will be comprehended by skilled artisan. 

In many drug screening programs which test libraries of compounds and natural 
extracts, high throughput assays are desirable in order to maximize the number of compounds 
surveyed in a given period of time. Assays which are performed in cell-free systems, such as 
may be derived with purified or semi-purified proteins, are often preferred as "primary" 
screens in that they can be generated to permit rapid development and relatively easy 
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detection of an alteration in a molecular target which is mediated by a test compound. 
Moreover, the effects of cellular toxicity and/or bioavailability of the test compound can be 
generally ignored in the in vitro system, the assay instead being focused primarily on the 
effect of the drug on the molecular target as may be manifest in an alteration of binding 
5 affinity with receptor proteins. Accordingly, in an exemplary screening assay of the present 
invention, the compound of interest is contacted with a hedgehog receptor polypeptide which 
is ordinarily capable of binding a hedgehog protein. To the mixture of the compound and 
receptor is then added a composition containing a hedgehog polypeptide. Detection and 
quantification of receptor/hedgehog complexes provides a means for determining the 
10 compound's efficacy at inMbiting (or potentiating) complex formation between the receptor 
protein and the hedgehog polypeptide^ The efficacy of the compound can be assessed by 
generating dose response curves from data obtained using various concentrations of the test 
compound. Moreover, a control assay can also be performed to provide a baseline for 
comparison. In the control assay, isolated and purified hedgehog polypeptide is added to a 

P 15 composition containing the receptor protein, and the formation of receptor/hedgehog 

CO 

0 complex is quantitated in the absence of the test compound. 

Si 

^ In an illustrative embodiment, the polypeptide utilized as a hedgehog receptor can be 

V] generated from the drosophila patched protein or a vertebrate homolog thereof. As described 

y3 in USSN 08/356,060, genetic data in the fruit fly was consistent with the patched gene 

* 20 product being a receptor for hedgehog. In light of the ability of, for example, Shh to activate 
j*j the drosophila HH pathways in transgenic flies (see Example 4), we had earlier concluded 

Q that vertebrate hedgehog proteins are capable of binding to drosophila HH receptors, 

j| including the patched protein. 

*f 1 Accordingly, an exemplary screening assay includes all or a suitable portion of the 

25 patched protein which can be obtained from, for example, the human patched gene (SEQ ID 
No. 42) or other vertebrate sources (see GenBank Accession numbers U40074 for chicken 
patched and U46155 for mouse patched), as well as from drosophila (GenBank Accession 
number M28999) or other invertebrate sources. The patched protein can be provided in the 
screening assay as a whole protein, or alternatively as a fragment of the full length protein 
30 which binds to hh polypeptides, e.g., as one or both of the substantial extracellular domains 
(e.g. corresponding to residues Asnl20-Ser438 and/or Arg770-Trpl027 of the human 
patched protein). For instance, the patched protein can be provided in soluble form, as for 
example a preparation of one of the extracellular domains, or a preparation of both of the 
extracellular domains which are covalently connected by an unstructured linker (see, for 
35 example, Huston et al. (1988) PNAS 85:4879; and U.S. Patent No. 5,091,513). In other 
embodiments, the protein can be provided as part of a liposomal preparation or expressed on 
the surface of a cell. The patched protein can derived from a recombinant gene, e.g., being 
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ectopically expressed in a heterologous cell. For instance, the protein can be expressed on 
oocytes, mammalian cells (e.g., COS, CHO, 3T3 or the like), or yeast cell by standard 
recombinant DNA techniques. These recombinant cells can be used for receptor binding, 
signal transduction or gene expression assays. Example 12 illustrates a binding assay of 
5 human hedgehog to chick patched protein ectopically expressed in Xenopus laevis oocytes. 
Si As illustrated in that example, Shh binds to the patched protein in a selective, saturable, dose- 
| dependent manner, thus demonstrating that patched is a receptor for Shh. The patched 
protein can be provided as a glycosylated protein, or the glycosylation state of the protein can 
be altered by mutation of glycosylation sites (e.g., Asn 349, 875 or 940). 

10 Complex formation between the hedgehog polypeptide and a hedgehog receptor may 

be detected by a variety of techniques. For instance, modulation of the formation of 
complexes can be quantitated using, for example, detectably labelled proteins such as 
radiolabelled, fluorescently labelled, or enzymatically labelled hedgehog polypeptides, by 
p immunoassay, or by chromatographic detection. 

m 

p 15 Typically, for cell-free assays, it will be desirable to immobilize either the hedgehog 

Nj receptor or the hedgehog polypeptide to facilitate separation of receptor/ hedgehog complexes 

jjj from uncomplexed forms of one of the proteins, as well as to accommodate automation of the 

P assay. In one embodiment, a fusion protein can be provided which adds a domain that allows 

^ the protein to be bound to a matrix. For example, glutathione-S-transferase/receptor 

p 20 (GST/receptor) fusion proteins can be adsorbed onto glutathione sepharose beads (Sigma 
Chemical, St. Louis, MO) or glutathione derivatized microtitre plates, which are then 
jy combined with the hedgehog polypeptide, e.g. an 35 S-labeled hedgehog polypeptide, and the 

y3 test compound and incubated under conditions conducive to complex formation, e.g. at 

v physiological conditions for salt and pH, though slightly more stringent conditions may be 

25 desired Following incubation, the beads are washed to remove any unbound hedgehog 
polypeptide, and the matrix bead-bound radiolabel determined directly (e.g. beads placed in 
scintillant), or in the supernatant after the receptor J hedgehog complexes are dissociated. 
Alternatively, the complexes can be dissociated from the bead, separated by SDS-PAGE gel, 
and the level of hedgehog polypeptide found in the bead fraction quantitated from the gel 
30 using standard electrophoretic techniques. 

Other techniques for immobilizing proteins on matrices are also available for use in 
the subject assay. For instance, soluble portions of the hedgehog receptor protein can be 
immobilized utilizing conjugation of biotin and streptavidin. For instance, biotinylated 
receptor molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
35 techniques well known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, EL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). 
Alternatively, antibodies reactive with the hedgehog receptor but which do not interfere with 



-61- 



hedgehog binding can be derivatized to the wells of the plate, and the receptor trapped in the 
wells by antibody conjugation. As above, preparations of a hedgehog polypeptide and a test 
compound are incubated in the receptor-presenting wells of the plate, and the amount of 
receptor/hedgehog complex trapped in the well can be quantitated. Exemplary methods for 
5 detecting such complexes, in addition to those described above for the GST-immobilized 
complexes, include immunodetection of complexes using antibodies reactive with the 
hedgehog polypeptide, or which are reactive with the receptor protein and compete for 
binding with the hedgehog polypeptide; as well as enzyme-linked assays which rely on 
detecting an enzymatic activity associated with the hedgehog polypeptide. In the instance of 
10 the latter, the enzyme can be chemically conjugated or provided as a fusion protein with the 
hedgehog polypeptide. To illustrate, the hedgehog polypeptide can be chemically cross- 
linked or genetically fused with alkaline phosphatase, and the amount of hedgehog 
polypeptide trapped in the complex can be assessed with a chromogenic substrate of the 
enzyme, e.g. paranitrophenylphosphate. Likewise, a fusion protein comprising the hedgehog 
0 15 polypeptide and glutathione-S-transferase can be provided, and complex formation 
§? quantitated by detecting the GST activity using l-chloro-2,4-dinitrobenzene (Habig et al 

(1974) J Biol Chem 249:71 30). 



W For processes which rely on immunodetection for quantitating one of the proteins 

J| trapped in the complex, antibodies against the protein, such as the anti-hedgehog antibodies 

* 20 described herein, can be used. Alternatively, the protein to be detected in the complex can be 
P "epitope tagged" in the form of a fusion protein which includes, in addition to the hedgehog 

Q polypeptide or hedgehog receptor sequence, a second polypeptide for which antibodies are 

^ readily available (e.g. from commercial sources). For instance, the GST fusion proteins 

w> described above can also be used for quantification of binding using antibodies against the 

25 GST moiety. Other useful epitope tags include myc-epitopes (e.g., see Ellison et al. (1991) J 
Biol Chem 266:21150-21157) which includes a 10-residue sequence from c-myc, as well as 
the pFLAG system (International Biotechnologies, Inc.) or the pEZZ-protein A system 
(Pharamacia, NJ). 

Where the desired portion of the hh receptor (or other hedgehog binding molecule) 
30 cannot be provided in soluble form, liposomal vesicles can be used to provide manipulatable 
and isolatable sources of the receptor. For example, both authentic and recombinant forms of 
the patched protein can be reconstituted in artificial lipid vesicles (e.g. phosphatidylcholine 
liposomes) or in cell membrane-derived vesicles (see, for example, Bear et al. (1992) Cell 
68:809-818; Newton et al. (1983) Biochemistry 22:6110-6117; and Reber et al. (1987) J Biol 
35 Chem 262:11369-11374). 

In addition to cell-free assays, such as described above, the readily available source of 
vertebrate hedgehog proteins provided by the present invention also facilitates the generation 
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of cell-based assays for identifying small molecule agonists/antagonists and the like. 
Analogous to the cell-based assays described above for screening combinatorial libraries, 
cells which are sensitive to hedgehog induction, e.g. patched-expressmg cells, can be 
contacted with a hedgehog protein and a test agent of interest, with the assay scoring for 
5 anything from simple binding to the cell to modulation in hedgehog inductive responses by 
the target cell in the presence and absence of the test agent. As with the cell-free assays, 
agents which produce a statistically significant change in hedgehog activities (either 
inhibition or potentiation) can be identified. 

In an illustrative embodiment, motor neuron progenitor cells, such as from neural 
10 plate explants, can be used as target cells. Treatment of such explanted cells with, for 
example, Shh causes the cells to differentiate into motor neurons. By detecting the co- 
expression of the LIM homeodomain protein Islet-1 (Thor et aL (1991) Neuron 7:881-889; 
Ericson et al. (1992) Science 256:1555-1560) and the immunoglobulin-like protein SCI 
(Tanaka et al. (1984) Dev Biol 106:26-37), the ability of a candidate agent to potentiate or 
§J 15 inhibit Shh induction of motor neuron differentiation can be measured. Additional illustrative 
F* examples of tissues responsive to Shh that can be used to identify candidate modulators of 

£ Shh activity include mesencephalic and bone tissues. Shh has been shown to induce the 

^ expression of the dopaminergic markers, tyrosine hydroxylase . and dopamine, in 

mesencephalc tissues (see Example 10). Mesencephalic cultures can be exposed to a 
20 candidate agent in the presence of Shh and assayed for changes in the expression of such 
jpj dopaminergic markers. 

f| Similarly, modulation of bone formation in response to hedgehog and the candidate 

agent can be assessed by alterations in mineral (hydroxyapatite) formation. For example, the 

^ test cells/tissue can be stained with Von Kossa and acid fuschin or toluidine blue (see 

25 Example 1 1). Alternatively, up- or down-regulation of transcription, such as the expression 
of homeobox genes (Hoxd genes), can be used as a dectable signal for the potentiation or 
inhibition of a hedgehog-induced signal. 

In addition, tissues responsive to a hedgehog protein acting in concert with other 
factors can be used to identify candidate modulators of hedgehog activity. For example, both 
30 the notochord and Sffi/have been shown to induce the expression of the sclerotomal marker 
Pax-1 in presomitic mesoderm explants (Fan and Tessier-Levigne (1994)). Using this assay, 
changes in the level of Pax-1 activity in the presence and absence of a candidate agent can be 
detected. 

In addition to characterizing ceils that naturally express the patched protein, cells 
35 which have been genetically engineered to ectopically express patched can be utilized for 
drug screening assays. As an example, cells which either express low levels or lack 
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expression of the patched protein, e.g. Xenopus laevis oocytes, COS cells or yeast cells, can 
be genetically modified using standard techniques to ectopically express the patched protein, 
(see Example 12). Other sources for cells which substantially lack expression include cells 
isolated from patients having heterozygous or homozygous mutations to the patched gene, as 
5 for example, cells isolated from certain patients with basal cell nevus syndrome or gorlin 
syndrome (see, for example, Johnson et al. (1996) Science 272:1668). As used herein, 
"substantially lack expression" of an endogenous patched protein refers to loss of expression, 
or expression of a mutant patched protein, which renders the cell at least one order of 
magnitude less sensitive to hedgehog signalling than the wild-type cell. Heterologous 
10 expression of a patched receptor can be carried out in such cells using expression constructs 
which are not sensitive to patched signalling, e.g., which do not use transcriptional regulatory 
sequence of the patched gene, in order to prevent confounding results which may otherwise 
occur by up regulation of the level of endogenous patched gene expression upon hedgehog 
stimulation. 

15 The resulting recombinant cells, e.g., which express a functional patched receptor, 

ff| can be utilized in receptor binding assays to identify agonist or anatagonsts of Shh binding. 

Binding assays can be performed using whole cells as described in Example 12. 
Furthermore, the recombinant cells of the present invention can be engineered to include 
other heterolgous genes encoding proteins involved in hedgehog-dependent siganl pathways. 
20 For example, the gene products of one or more of smoothened, costal-2 and/or fused can be 
co-expressed with patched in the reagent cell, with assays being sensitive to the functional 
reconstituion of the hedgehog signal transduction cascade. 

Alternatively, liposomal preparations using reconstituted patched protein can be 



p utilized. Patched protein purified from detergent extracts from both authentic and 

25 recombinant origins can be reconstituted in in artificial lipid vesicles (e.g. 
phosphatidylcholine liposomes) or in cell membrane-derived vesicles (see, for example, Bear 
et al. (1992) Cell 68:809-818; Newton et al. (1983) Biochemistry 22:6110-6117; and Reber et 
al. (1987) J Biol Chem 262:11369-11374). The lamellar structure and size of the resulting 
liposomes can be characterized using electron microscopy. External orientation of the 
30 patched protein in the reconstituted membranes can be demonstrated, for example, by 
immunoelectron microscopy. The hedgehog protein binding activity of liposomes containing 
patched and liposomes without the protein in the presence of candidate agents can be 
compared in order to identify potential modulators of the hedgehog-patched interaction. 

The hedgehog protein used in these cell-based assays can be provided as a purified 
35 source (natural or recombinant in origin), or in the form of cells/tissue which express the 
protein and which are co-cultured with the target cells. As in the cell-free assays, where 
simple binding (rather than induction) is the hedgehog activity scored for in the assay, the 
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protein can be labelled by any of the above-mentioned techniques, e.g., fluorescently, 
enzymatically or radioactively, or detected by immunoassay. 

In addition to binding studies, functional assays can be used to identified modulators, 
i.e., agonists or antagonists, of hedgehog protein activity. By detecting changes in 
5 intracellular signals, such as alterations in second messengers or gene expression, in cells 
contacted with a hedgehog protein and a test agent, candidate agonists and antagonists to 
hedgehog signaling can be identified. To illustrate, the intracellular signal that is transduced 
can be initiated by the specific interaction of the hh polypeptide with its cell surface receptor, 
e.g. patche d protein. In Drosophila, and presumtively in vertebrate cells as well, a number 

10 of gene products, including patched, the transcription factor cubitus interruptus (ci), the 
serine/threonine kinase fused (fu) and the gene products of costal-2, smoothened and 
suppressor of fused, have been implicated as putative components of hedgehog-dependent 
signal transduction pathwats. The recent cloning of vertebrate homologs of the drosophila 
genes suggests that the hedgehog signaling pathway is highly conserved from drosophila to 

15 vertebrate species. 

?] The interaction of a hedgehog protein with its receptor sets in motion a cascade 

*P involving the activation and inhibition of downstream effectors, the ultimate consequence of 

jlj which is, in some instances, a detectable change in the transcription or translation of a gene. 

Potential transcriptional targets of hedgehog signaling are the patched gene itself (Hidalgo 
%. 20 and Ingham, 1990 Development 110, 291-301; Marigo et al., 1996 ) and the vertebrate 
\l homologs of the drosophila cubitus interruptus gene, the GLI genes (Hui et al. (1994) Dev 

O Biol 162:402-413). Patched gene expression has been shown to be induced in cells of the 

j~ limb bud and the neural plate that are responsive to Shh. (Marigo et al. (1996) PNAS, in 

pi press; Marigo et al. (1996) Development 122:1225-1233). The GLI genes encode putative 

25 transcription factors having zinc finger DNA binding domains (Orenic et al. (1990) Genes & 
Dev 4:1053-1067; Kinzler et al. (1990) Mol Cell Biol 10:634-642). Transcription of the GLI 
gene has been reported to be upregulated in response to hedgehog in limb buds, while 
transcription of the GLI3 gene is downregulated in response to hedgehog induction (Marigo 
et al. (1996) Development 122:1225-1233). By selecting transcriptional regulatory sequences 
30 from such target genes, e.g. from patched or GLI genes, that are responsible for the up- or 
down regulation of these genes in response to hedgehog induction, and operatively linking 
such promoters to a reporter gene, the present invention provides a transcription based assay 
which is sensitive to the ability of a specific test compound to influence hedgehog signalling 
pathways. Expression of the reporter gene, thus, provides a valuable screening tool for the 
35 development of compounds that act as agonists or antagonists of hedgehog induction. 

Reporter gene based assays of this invention measure the end stage of the above 
described cascade of events, e.g., transcriptional modulation. Accordingly, in practicing one 
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embodiment of the assay, a reporter gene construct is inserted into the reagent cell in order to 
generate a detection signal dependent on hedgehog signaling. To identify potential 
regulatory elements responsive to hedgehog signaling present in the transcriptional regulatory 
sequence of a target gene, nested deletions of genomic clones of the target gene can be 
5 constructed using standard techniques. See, for example, Current Pr otocols in Molecular 
Biology. Ausubel, F.M et aL (eds.) Greene Publishing Associates, (1989); U.S. Patent 
5,266,488; Sato et aL (1995) J Biol Chem 270:10314-10322; and Kube et aL (1995) Cytokine 
7:1-7. A nested set of DNA fragments from the gene's 5-flanking region are placed upstream 
of a reporter gene, such as the luciferase gene, and assayed for their ability to direct reporter 
10 gene expression in patched expressing cells. Host cells transiently transfected with reporter 
gene constructs can be scored for the induction of expression of the reporter gene in the 
presence and absence of hedgehog to determine regulatory sequences which are responsice to 
hedgehog-dependent patched signalling. 

In practicing one embodiment of the assay, a reporter gene construct is inserted into 
OB 15 the reagent cell in order to generate a detection signal dependent on second messengers 
CP generated by induction with hedgehog protein. Typically, the reporter gene construct will 

j« include a reporter gene in operative linkage with one or more transcriptional regulatory 

Ul elements responsive to the hedgehog activity, with the level of expression of the reporter 

>! gene providing the hedgehog-dependent detection signal. The amount of transcription from 

* 20 the reporter gene may be measured using any method known to those of skill in the art to be 
O suitable. For example, mRNA expression from the reporter gene may be detected using 

p RNAse protection or RNA-based PGR, or the protein product of the reporter gene may be 

RJ identified by a characteristic stain or an intrinsic activity. The amount of expression from the 

reporter gene is then compared to the amount of expression in either the same cell in the 
25 absence of the test compound (or hedgehog) or it may be compared with the amount of 
transcription in a substantially identical cell that lacks the target receptor protein. Any 
statistically or otherwise significant difference in the amount of transcription indicates that 
the test compound has in some manner altered the inductive activity of the hedgehog protein. 
As described in further detail below, in preferred embodiments the gene product of 
30 the reporter is detected by an intrinsic activity associated with that product. For instance, the 
reporter gene may encode a gene product that, by enzymatic activity, gives rise to a detection 
signal based on color, fluorescence, or luminescence. In other preferred embodiments, the 
reporter or marker gene provides a selective growth advantage, e.g., the reporter gene may 
enhance cell viability, relieve a cell nutritional requirement, and/or provide resistance to a 
35 drug. 

Preferred reporter genes are those that are readily detectable. The reporter gene may 
also be included in the construct in the form of a fusion gene with a gene that includes 
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desired transcriptional regulatory sequences or exhibits other desirable properties. Examples 
of reporter genes include, but are not limited to CAT (chloramphenicol acetyl transferase) 
(Alton and Vapnek (1979), Nature 282: 864-869) luciferase, and other enzyme detection 
systems, such as beta-galactosidase; firefly luciferase (deWet et al. (1987), Mol. Cell. Biol. 
5 7:725-737); bacterial luciferase (Engebrecht and Silverman (1984), PNAS 1: 4154-4158; 
Baldwin et al. (1984), Biochemistry 23: 3663-3667); alkaline phosphatase (Toh et al. (1989) 
Eur. J. Biochem. 182: 231-238, Hall et al. (1983) J. Mol. Appl. Gen. 2: 101), human 
placental secreted alkaline phosphatase (Cullen and Malim (1992) Methods in Enzymol. 
216:362-368). 

10 Transcriptional control elements which may be included in a reporter gene construct 

include, but are not limited to, promoters, enhancers, and repressor and activator binding 
sites. Suitable transcriptional regulatory elements may be derived from the transcriptional 
regulatory regions of genes whose expression is induced after modulation of a hedgehog 
receptor's signal transduction activity. The characteristics of preferred genes from which the 
§ 15 transcriptional control elements are derived include, but are not hmited to, low or 
undetectable expression in quiescent cells, rapid induction at the transcriptional level within 
J minutes of extracellular simulation, induction that is transient and independent of new protein 

|0 synthesis, subsequent shut-off of transcription requires new protein synthesis, and mKNAs 

transcribed from these genes have a short half-life. It is not necessary for all of these 
20 properties to be present, 
y In yet other embodiments, second messenger generation can be measured directly in 

O the detection step, such as mobilization of intracellular calcium, phospholipid metabolism or 

^ adenylate cyclase activity are quantitated, for instance, the products of phospholipid 

m hydrolysis IP3, DAG or cAMP could be measured For example, recent studies have 

25 implicated protein kinase A (PKA) as a possible component of hedgehog signaling in 
drosophila and vertebrate organisms (Hammerschmidt et al. (1996) Genes & Dev 10:647). 
High PKA activity has been shown to antagonize hedgehog signaling in these systems. 
Although it is unclear whether PKA acts directly downstream or m parallel with hedgehog 
signaling, it is possible that hedgehog signaling occurs via inhibition of PKA activity. Thus, 
30 detection of PKA activity provides a potential readout for the instant assays. 

Certain hedehog receptors may stimulate the activity of phospholipases. Inositol 
lipids can be extracted and analyzed using standard lipid extraction techniques. Water soluble 
derivatives of all three inositol lipids (IP,, IP 2 , IP3) be q™»mtatal using 

radiolabelling techniques or HPLC. 
35 The mobilization of intracellular calcium or the influx of calcium from outside the 

cell may be a response to hedgehog stimulation or lack there of. Calcium flux in the reagent 
cell can be measured using standard techniques. The choice of the appropriate calcium 
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indicator, fluorescent, bioluminescent, metaUochromic, or Ca^-sensitive microelectrodes 
depends on the cell type and the magnitude and time constant of the event under study (Borle 
(1990) Environ Health Perspect 84:45-56). As an exemplary method of Ca 4- * detection, 
cells could be loaded with the Ca^sensitive fluorescent dye ftira-2 or indo-1, using standard 
5 methods, and any change in Ca" 1 " 1 " measured using a fluorometer. 

In certain embodiments of the assay, it may be desirable to screen for changes in 
cellular phosphorylation. As an example, the drosophila gene fused (fu) which encodes a 
serine/threonine kinase has been identified as a potential downstream target in hedgehog 
signaling. (Preat et al., 1990 Nature 347, 87-89; Therond et al. 1993, Meek Dev. 44. 65-80). 
10 The ability of compounds to modulate serine/threonine kinase activation could be screened 
using colony immunoblotting (Lyons and Nelson (1984) Proc. Natl. Acad Sci. USA 81:7426- 
7430) using antibodies against phosphorylated serine or threonine residues. Reagents for 
performing such assays are commercially available, for example, phosphoserine and 

^ phosphothreonine specific antibodies which measure increases in phosphorylation of those 

p 1 5 residues can be purchased from comercial sources. 

?] After identifying certain test compounds as potential modulators of the target 

*P hedgehog receptor activity, the practioner of the subject assay will continue to test the 

jj efficacy and specificity of the selected compounds both in vitro and in vivo. Whether for 

subsequent in vivo testing, or for administration to an animal as an approved drug, agents 
? 20 identified in the subject assay can be formulated in pharmaceutical preparations for in vivo 
%j administration to an animal, preferably a human. 

P . The subject compounds selected in the subject, or a pharmaceutically acceptable salt 

£ thereof, may accordingly be formulated for administration with a biologically acceptable 

V* medium, such as water, buffered saline, polyol (for example, glycerol, propylene glycol, 

25 liquid polyethylene glycol and the like) or suitable mixtures thereof. The optimum 
concentration of the active ingredient(s) in the chosen medium can be determined 
empirically, according to procedures well known to medicinal chemists. As used herein, 
"biologically acceptable medium" includes any and all solvents, dispersion media, and the 
like which may be appropriate for the desired route of administration of the pharmaceutical 
30 preparation. The use of such media for pharmaceutically active substances is known in the 
art. Except insofar as any conventional media or agent is incompatible with the activity of 
the compound, its use in the pharmaceutical preparation of the invention is contemplated. 
Suitable vehicles and their formulation inclusive of other proteins are described, for example, 
in the book Remington's Pharmaceutical Sciences (Remington's Pharmaceutical Sciences. 
35 Mack Publishing Company, Easton, Pa., USA 1985). These vehicles include injectable 
"deposit formulations". Based on the above, such pharmaceutical formulations include, 
although not exclusively, solutions or freeze-dried powders of the compound in association 
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with one or more pharmaceutical acceptable vehicles or diluents, and contained in buffered 
media at a suitable pH and isosmotic with physiological fluids. In preferred embodiment, the 
compound can be disposed in a sterile preparation for topical and/or systemic administration. 
In the case of freeze-dried preparations, supporting excipients such as, but not exclusively, 
5 mannitol or glycine may be used and appropriate buffered solutions of the desired volume 
will be provided so as to obtain adequate isotonic buffered solutions of the desired pH. 
Similar solutions may also be used for the pharmaceutical compositions of compounds in 
isotonic solutions of the desired volume and include, but not exclusively, the use of buffered 
saline solutions with phosphate or citrate at suitable concentrations so as to obtain at all times 
1 0 isotonic pharmaceutical preparations of the desired pH, (for example, neutral pH). 

In yet another embodiment, the method of the present invention can be used to isolate 
and clone hedgehog receptors. For example, purified hedgehog proteins of the present 
invention can be employed to precipitate hedgehog receptor proteins from cell fractions 
P prepared from cells which are responsive to a hedgehog protein. For instance, purified 
W 15 hedgehog protein can be derivatized with biotin (using, for instance, NHS-Biotin, Pierce 
J Chemical catalog no. 21420G), and the biotinylated protein utilized to saturate membrane 
£ bound hh receptors. The hedgehog bound receptors can subsequently be adsorbed or 
immobilized on streptavidin. If desired, the hedgehog-receptor complex can be cross-linked 
with a chemical cross-linking agent In such as manner, hh receptors can be purified, 
_ 20 preferably to near homogeneity. The isolated hh receptor can then be partially digested with, 
y for example, trypsin, and the resulting peptides separated by reverse-phase chromatography. 

P The chromatography fragments are then analyzed by Edman degradation to obtain single 

j| sequences for two or more of the proteolytic fragments. From the chemically determined 

(p amino acid sequence for each of these tryptic fragments, a set of oligonucleotide primers can 

25 be designed for PCR. These primers can be used to screen both genomic and cDNA libraries. 
Similar strategies for cloning receptors have been employed, for example, to obtain the 
recombinant gene for somatostatin receptors (Eppler et al. (1992) J Biol Chem 267:15603- 
15612). 

Other techniques for identifying hedgehog receptors by expression cloning will be 
30 evident in light of the present disclosure. For instance, purified hh polypeptides can be 
immobilized in wells of micro titre plates and contacted with, for example, COS cells 
transfected with a cDNA library (e.g., from tissue expected to be responsive to hedgehog 
induction). From this panning assay, cells which express hedgehog receptor molecules can 
be isolated on the basis of binding to the immobilized hedgehog protein. Another cloning 
35 system, described in PCT publications WO 92/06220 of Flanagan and Leder, involves the use 
of an expression cloning system whereby a hedgehog receptor is scored on the basis of 
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binding to a hedgehog/alkaline phosphatase fusion protein (see also Cheng et al. (1994) Cell 
79:157-168) 

Another aspect of the present invention relates to a method of inducing and/or 
maintaining a differentiated state, enhancing survival, and/or promoting proliferation of a cell 
5 responsive to a vertebrate hedgehog protein, by contacting the cells with an hh agonist or an 
hh antagonist as the circumstances may warrant. For instance, it is contemplated by the 
invention that, in light of the present finding of an apparently broad involvement of hedgehog 
proteins in the formation of ordered spatial arrangements of differentiated tissues in 
vertebrates, the subject method could be used to generate and/or maintain an array of 
10 different vertebrate tissue both in vitro and in vivo. The hh agent, whether inductive or anti- 
inductive, can be, as appropriate, any of the preparations described above, including isolated 
polypeptides, gene therapy constructs, antisense molecules, peptidomimetics or agents 
identified in the drug assays provided herein. Moreover, it is contemplated that, based on the 
P observation of activity of the vertebrate hedgehog proteins in drosophila, hh agents, for 
jS 15 purposes of therapeutic and diagnostic uses, can include the Dros-HH protein and homologs 
Fl thereof. Moreover, the source of hedgehog protein can be, in addition to purified protein or 



*p recombinant cells, cells or tissue explants which naturally produce one or more hedgehog 
proteins. For instance, as described in Example 2, neural tube explants from embryos, 
particularly floorplate tissue, can provide a source for Shh polypeptide, which source can be 
20 implanted in a patient or otherwise provided, as appropriate, for induction or maintenance of 



differentiation. 



P 
S! 

pi For example, the present method is applicable to cell culture techniques. In vitro 

>Q neuronal culture systems have proved to be fundamental and indispensable tools for the study 

^ of neural development, as well as the identification of neurotrophic factors such as nerve 

25 growth factor (NGF), ciliary trophic factors (CNTF), and brain derived neurotrophic factor 
(BDNF). Once a neuronal cell has become terminally-differentiated it typically will not 
change to another terminally differentiated cell-type. However, neuronal cells can 
nevertheless readily lose their differentiated state. This is commonly observed when they are 
grown in culture from adult tissue, and when they form a blastema during regeneration. The 
30 present method provides a means for ensuring an adequately restrictive environment in order 
to maintain neuronal cells at various stages of differentiation, and can be employed, for 
instance, in cell cultures designed to test the specific activities of other trophic factors. In 
such embodiments of the subject method, the cultured cells can be contacted with an hh 
polypeptide, or an agent identified in the assays described above, in order to induce neuronal 
35 differentiation (e.g. of a stem cell), or to maintain the integrity of a culture of terminally- 
differentiated neuronal cells by preventing loss of differentiation. The source of hedgehog 
protein in the culture can be derived from, for example, a purified or semi-purified protein 
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composition added directly to the cell culture media, or alternatively, supported and/or 
released from a polymeric device which supports the growth of various neuronal cells and 
which has been doped with the protein. The source of the hedgehog protein can also be a 
cell that is co-cultured with the intended neuronal cell and which produces a recombinant hh. 
5 Alternatively, the source can be the neuronal cell itself which has been engineered to produce 
a recombinant hedgehog protein. In an exemplary embodiment, a naive neuronal ceil (e.g. a 
stem cell) is treated with an hh agonist in order to induce differentiation of the cells into, for 
example, sensory neurons or, alternatively, motorneurons. Such neuronal cultures can be 
used as convenient assay systems as well as sources of implantable cells for therapeutic 
10 treatments. For example, hh polypeptides may be useful in establishing and maintaining the 
olfactory neuron cultures described in U.S. Patent 5,318,907 and the like. 

According to the present invention, large numbers of non-tumorigenic neural 
progenitor cells can be perpetuated in vitro and induced to differentiate by contact with 
P hedgehog proteins. Generally, a method is provided comprising the steps of isolating neural 
CO 15 progenitor cells from an animal, perpetuating these cells in vitro or in vivo, preferably in the 
J? ! presence of growth factors, and differentiating these cells into particular neural phenotypes, 
-p e.g., neurons and glia, by contacting the cells with a hedgehog agonist. 

Q Progenitor cells are thought to be under a tonic inhibitory influence which maintains 

^ the progenitors in a suppressed state until their differentiation is required. However, recent 

p 20 techniques have been provided which permit these cells to be proliferated, and unlike neurons 

which are terminally differentiated and therefore non-dividing, they can be produced in 

unlimited number and are highly suitable for transplantation into heterologous and 

y3 autologous hosts with neurodegenerative diseases. 

01 

By "progenitor" it is meant an oligopotent or multipotent stem cell which is able to 
25 divide without limit and, under specific conditions, can produce daughter cells which 
terminally differentiate such as into neurons and glia. These cells can be used for 
transplantation into a heterologous or autologous host. By heterologous is meant a host other 
than the animal from which the progenitor cells were originally derived. By autologous is 
meant the identical host from which the cells were originally derived. 

30 Cells can be obtained from embryonic, post-natal, juvenile or adult neural tissue from 

any animal. By any animal is meant any multicellular animal which contains nervous tissue. 
More particularly, is meant any fish, reptile, bird, amphibian or mammal and the like. The 
most preferable donors are mammals, especially mice and humans. 

In the case of a heterologous donor animal, the animal may be euthanized, and the 
35 brain and specific area of interest removed using a sterile procedure. Brain areas of particular 
interest include any area from which progenitor cells can be obtained which will serve to 



SI 
P 

ru 



-71- 



restore function to a degenerated area of the host's brain. These regions include areas of the 
central nervous system (CNS) including the cerebral cortex, cerebellum, midbrain, brainstem, 
spinal cord and ventricular tissue, and areas of the peripheral nervous system (PNS) including 
the carotid body and the adrenal medulla. More particularly, these areas include regions in the 
5 basal ganglia, preferably the striatum which consists of the caudate and putamen, or various 
cell groups such as the globus pallidus, the subthalamic nucleus, the nucleus basalis which is 
found to be degenerated in Alzheimer's Disease patients, or the substantia nigra pars 
compacta which is found to be degenerated in Parkinson's Disease patients. 

Human heterologous neural progenitor cells may be derived from fetal tissue obtained 
1 0 from elective abortion, or from a post-natal, juvenile or adult organ donor. Autologous neural 
tissue can be obtained by biopsy, or from patients undergoing neurosurgery in which neural 
tissue is removed, in particular during epilepsy surgery, and more particularly during 
temporal lobectomies and hippocampalectomies. 

O Cells can be obtained from donor tissue by dissociation of individual cells from the 

60 

p 15 connecting extracellular matrix of the tissue. Dissociation can be obtained using any known 
^ procedure, including treatment with enzymes such as trypsin, collagenase and the like, or by 
ih using physical methods of dissociation such as with a blunt instrument. Dissociation of fetal 

P cells can be carried out in tissue culture medium, while a preferable medium for dissociation 

^ of juvenile and adult cells is artificial cerebral spinal fluid (aCSF). Regular aCSF contains 

P 20 124 mM NaCl, 5 mM KC1, L3 mM MgCl2, 2 mM CaCl2, 26 raM NaHC03, and 10 mM D- 
^ glucose. Low Ca 2+ aCSF contains the same ingredients except for MgCl2 at a concentration 

py of 3 .2 mM and CaCl2 at a concentration of 0. 1 mM. 

*0 

|ji Dissociated cells can be placed into any known culture medium capable of supporting 

cell growth, including MEM, DMEM, RPMI, F-12, and the like, containing supplements 

25 which are required for cellular metabolism such as glutamine and other amino acids, 
vitamins, minerals and useful proteins such as transferrin and the like. Medium may also 
contain antibiotics to prevent contamination with yeast, bacteria and fungi such as penicillin, 
streptomycin, gentamicin and the like. In some cases, the medium may contain serum 
derived from bovine, equine, chicken and the like. A particularly preferable medium for cells 

30 is a mixture of DMEM and F-12. 

Conditions for culturing should be close to physiological conditions. The pH of the 
culture media should be close to physiological pH, preferably between pH 6-8, more 
preferably close to pH 7, even more particularly about pH 7.4. Cells should be cultured at a 
temperature close to physiological temperature, preferably between 30°C-40°C, more 
35 preferably between 32°C-38°C, and most preferably between 35°C-37°C. 
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Cells can be grown in suspension or on a fixed substrate, but proliferation of the 
progenitors is preferably done in suspension to generate large numbers of cells by formation 
of "neurospheres" (see, for example, Reynolds et al. (1992) Science 255:1070-1709; and PCT 
Publications WO93/01275, WO94/09119, WO94/10292, and W094/16718). In the case of 
5 propagating (or splitting) suspension cells, flasks are shaken well and the neurospheres 
allowed to settle on the bottom corner of the flask. The spheres are then transferred to a 50 
ml centrifuge tube and centrifuged at low speed. The medium is aspirated, the cells 
resuspended in a small amount of medium with growth factor, and the cells mechanically 
dissociated and resuspended in separate aliquots of media. 

10 Cell suspensions in culture medium are supplemented with any growth factor which 

allows for the proliferation of progenitor cells and seeded in any receptacle capable of 
sustaining cells, though as set out above, preferably in culture flasks or roller bottles. Cells 
typically proliferate within 3-4 days in a 37°C incubator, and proliferation can be reinitiated 

p at any time after that by dissociation of the cells and resuspension in fresh medium containing 

CB 15 growth factors. 

? 

^ In the absence of substrate, cells lift off the floor of the flask and continue to 

yi proliferate in suspension forming a hollow sphere of undifferentiated cells. After 

0 approximately 3-10 days in vitro, the proliferating clusters (neurospheres) are fed every 2-7 
^ days, and more particularly every 2-4 days by gentle centrifugation and resuspension in 
p 20 medium containing growth factor. 

p After 6-7 days in vitro, individual cells in the neurospheres can be separated by 

physical dissociation of the neurospheres with a blunt instrument, more particularly by 

01 triturating the neurospheres with a pipette. Single cells from the dissociated neurospheres are 
suspended in culture medium containing growth factors, and differentiation of the cells can 

25 be induced by plating (or resuspending) the cells in the presence of a hedgehog agonist, and 
(optionally) any other factor capable of sustaining differentiation, such as bFGF and the like. 

To further illustrate other uses of hedgehog agonists and antagonists, it is noted that 
intracerebral grafting has emerged as an additional approach to central nervous system 
therapies. For example, one approach to repairing damaged brain tissues involves the 

30 transplantation of cells from fetal or neonatal animals into the adult brain (Dunnett et al. 
(1987) J Exp Biol 123:265-289; and Freund et al. (1985) J Neurosci 5:603-616). Fetal 
neurons from a variety of brain regions can be successfully incorporated into the adult brain, 
and such grafts can alleviate behavioral defects. For example, movement disorder induced by 
lesions of dopaminergic projections to the basal ganglia can be prevented by grafts of 

35 embryonic dopaminergic neurons. Complex cognitive functions that are impaired after 
lesions of the neocortex can also be partially restored by grafts of embryonic cortical cells. 
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The use of hedgehog proteins or mimetics, such as Shh or Dhh, in the culture can prevent 
loss of differentiation, or where fetal tissue is used, especially neuronal stem cells, can be 
used to induce differentiation. 

Stem cells useful in the present invention are generally known. For example, several 
5 neural crest cells have been identified, some of which are multipotent and likely represent 
uncommitted neural crest cells, and others of which can generate only one type of cell, such 
as sensory neurons, and likely represent committed progenitor cells. The role of hedgehog 
proteins employed in the present method to culture such stem cells can be to induce 
differentiation of the uncommitted progenitor and thereby give rise to a committed progenitor 
10 cell, or to cause further restriction of the developmental fate of a committed progenitor cell 
towards becoming a terminally-differentiated neuronal cell. For example, the present method 
can be used in vitro to induce and/or maintain the differentiation of neural crest cells into 
glial cells, Schwann cells, chromaffin cells, cholinergic sympathetic or parasympathetic 
p neurons, as well as peptidergic and serotonergic neurons. The hedgehog protein can be used 
?S 15 alone, or can be used in combination with other neurotrophic factors which act to more 
f\ particularly enhance a particular differentiation fate of the neuronal progenitor cell. In the 

,p later instance, an hh polypeptide might be viewed as ensuring that the treated cell has 
j*- achieved a particular phenotypic state such that the cell is poised along a certain 

2jj developmental pathway so as to be properly induced upon contact with a secondary 
* 20 neurotrophic factor. In similar fashion, even relatively undifferentiated stem cells or 
primitive neuroblasts can be maintained in culture and caused to differentiate by treatment 
with hedgehog agonists. Exemplary primitive cell cultures comprise cells harvested from the 
neural plate or neural tube of an embryo even before much overt differentiation has occurred. 



m 



In addition to the implantation of cells cultured in the presence of a functional 
25 hedgehog activity and other in vitro uses described above, yet another aspect of the present 
invention concerns the therapeutic application of a hedgehog protein or mimetic to enhance 
survival of neurons and other neuronal cells in both the central nervous system and the 
peripheral nervous system. The ability of hedgehog protein to regulate neuronal 
differentiation during development of the nervous system and also presumably in the adult 
30 state indicates that certain of the hedgehog proteins can be reasonably expected to facilitate 
control of adult neurons with regard to maintenance, functional performance, and aging of 
normal cells; repair and regeneration processes in chemically or mechanically Iesioned cells; 
and prevention of degeneration and premature death which result from loss of differentiation 
in certain pathological conditions. In light of this understanding, the present invention 
35 specifically contemplates applications of the subject method to the treatment of (prevention 
and/or reduction of the severity of) neurological conditions deriving from: (i) acute, 
subacute, or chronic injury to the nervous system, including traumatic injury, chemical 
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injury, vasal injury and deficits (such as the ischemia resulting from stroke), together with 
infectious/inflammatory and tumor-induced injury; (ii) aging of the nervous system 
including Alzheimer's disease; (iii) chronic neurodegenerative diseases of the nervous 
system, including Parkinson's disease, Huntington's chorea, amyotrophic lateral sclerosis and 
5 the like, as well as spinocerebellar degenerations; and (iv) chronic immunological diseases of 
the nervous system or affecting the nervous system, including multiple sclerosis. 

Many neurological disorders are associated with degeneration of discrete populations 
of neuronal elements and may be treatable with a therapeutic regimen which includes a 
hedgehog agonist. For example, Alzheimer's disease is associated with deficits in several 
10 neurotransmitter systems, both those that project to the neocortex and those that reside with 
the cortex. For instance, the nucleus basalis in patients with Alzheimer's disease have been 
observed to have a profound (75%) loss of neurons compared to age-matched controls. 
Although Alzheimer's disease is by far the most common form of dementia, several other 
P disorders can produce dementia. Several of these are degenerative diseases characterized by 
CO 15 the death of neurons in various parts of the central nervous system, especially the cerebral 
ffl cortex. However, some forms of dementia are associated with degeneration of the thalmus or 
«P the white matter underlying the cerebral cortex. Here, the cognitive dysfunction results from 
^ the isolation of cortical areas by the degeneration of efferents and afferents. Huntington's 

disease involves the degeneration of intrastraital and cortical cholinergic neurons and 
* 20 GAB Aergic neurons. Pick's disease is a severe neuronal degeneration in the neocortex of the 
Q frontal and anterior temporal lobes, sometimes accompanied by death of neurons in the 

striatum. Treatment of patients suffering from such degenerative conditions can include the 
application of hedgehog polypeptides, or agents which mimic their effects, in order to 
01 control, for example, differentiation and apoptotic events which give rise to loss of neurons 

25 (e.g. to enhance survival of existing neurons) as well as promote differentiation and 
repopulation by progenitor cells in the area affected. In preferred embodiments, a source of a 
hedgehog agent is stereotactically provided within or proximate the area of degeneration. In 
addition to degenerative-induced dementias, a pharmaceutical preparation of one or more of 
the subject hedgehog proteins can be applied opportunely in the treatment of 
30 neurodegenerative disorders which have manifestations of tremors and involuntary 
movements. Parkinson's disease, for example, primarily affects subcortical structures and is 
characterized by degeneration of the nigrostriatal pathway, raphe nuclei, locus cereleus, and 
the motor nucleus of vagus. Ballism is typically associated with damage to the subthalmic 
nucleus, often due to acute vascular accident. Also included are neurogenic and myopathic 
35 diseases which ultimately affect the somatic division of the peripheral nervous system and are 
manifest as neuromuscular disorders. Examples include chronic atrophies such as 
amyotrophic lateral sclerosis, Guillain-Barre syndrome and chronic peripheral neuropathy, as 
well as other diseases which can be manifest as progressive bulbar palsies or spinal muscular 
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atrophies. The present method is amenable to the treatment of disorders of the cerebellum 
which result in hypotonia or ataxia, such as those lesions in the cerebellum which produce 
disorders in the limbs ipsilateral to the lesion. For instance, a preparation of a hedgehog 
homolog can used to treat a restricted form of cerebellar cortical degeneration involving the 
anterior lobes (vermis and leg areas) such as is common in alcoholic patients. 

In an illustrative embodiment, the subject method is used to treat amyotrophic lateral 
sclerosis. ALS is a name given to a complex of disorders that comprise upper and lower 
motor neurons. Patients may present with progressive spinal muscular atrophy, progressive 
bulbar palsy, primary lateral sclerosis, or a combination of these conditions. The major 
pathological abnormality is characterized by a selective and progressive degeneration of the 
lower motor neurons in the spinal cord and the upper motor neurons in the cerebral cortex. 
The therapeutic application of a hedgehog agonist, particularly Dhh, can be used alone, or in 
conjunction with other neurotrophic factors such as CNTF, BDNF or NGF to prevent and/or 
reverse motor neuron degeneration in ALS patients. 

Hedgehog proteins of the present invention can also be used in the treatment of 
autonomic disorders of the peripheral nervous system, which include disorders affecting the 
innervation of smooth muscle and endocrine tissue (such as glandular tissue). For instance, 
the subject method can be used to treat tachycardia or atrial cardiac arrythmias which may 
arise from a degenerative condition of the nerves innervating the striated muscle of the heart. 

Furthermore, a potential role for certain of the hedgehog proteins, which is apparent 
from the appended examples, mainly the data of respecting hedgehog expression in sensory 
and motor neurons of the head and trunk (including limb buds), concerns the role of 
hedgehog proteins in development and maintenance of dendritic processes of axonal neurons. 
Potential roles for hedgehog proteins consequently include guidance for axonal projections 
and the ability to promote differentiation and/or maintenance of the innervating cells to their 
axonal processes. Accordingly, compositions comprising hedgehog agonists or other 
hedgehog agents described herein, may be employed to support, or alternatively antagonize 
the survival and reprojection of several types of ganglionic neurons sympathetic and sensory 
neurons as well as motor neurons. In particular, such therapeutic compositions may be useful 
in treatments designed to rescue, for example, various neurons from lesion-induced death as 
well as guiding reprojection of these neurons after such damage. Such diseases include, but 
are not limited to, CNS trauma infarction, infection (such as viral infection with varicella- 
zoster), metabolic disease, nutritional deficiency, toxic agents (such as cisplatin treatment). 
Moreover, certain of the hedgehog agents (such as antagonistic form) may be useful in the 
selective ablation of sensory neurons, for example, in the treatment of chronic pain 
syndromes. 
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As appropriate, hedgehog agents can be used in nerve prostheses for the repair of 
central and peripheral nerve damage. In particular, where a crushed or severed axon is 
intubulated by use of a prosthetic device, hedgehog polypeptides can be added to the 
prosthetic device to increase the rate of growth and regeneration of the dendridic processes. 
5 Exemplary nerve guidance channels are described in U.S. patents 5,092,871 and 4,955,892. 
Accordingly, a severed axonal process can be directed toward the nerve ending from which it 
was severed by a prosthesis nerve guide which contains, e.g. a semi-solid formulation 
containing hedgehog polypeptide or mimetic, or which is derivatized along the inner walls 
with a hedgehog protein. 

10 In another embodiment, the subject method can be used in the treatment of neoplastic 

or hyperplastic transformations such as may occur in the central nervous system. For 
instance, certain of the hedgehog proteins (or hh agonists) which induce differentiation of 
neuronal cells can be utilized to cause such transformed cells to become either post-mitotic or 
apoptotic. Treatment with a hedgehog agent may facilitate disruption of autocrine loops, 

15 such as TGF-p or PDGF autostimulatory loops, which are believed to be involved in the 
neoplastic transformation of several neuronal tumors. Hedgehog agonists may, therefore, 
thus be of use in the treatment of, for example, malignant gliomas, medulloblastomas, 
neuroectodermal tumors, and ependymonas. 

Yet another aspect of the present invention concerns the application of the discovery 
20 that hedgehog proteins are morphogenic signals involved in other vertebrate organogenic 
pathways in addition to neuronal differentiation as described above, having apparent roles in 
other endodermal patterning, as well as both mesodermal and endodermal differentiation 
processes. As described in the Examples below, Shh clearly plays a role in proper limb 
growth and patterning by initiating expression of signaling molecules, including Bmp-2 in the 
25 mesoderm and Fgf-4 in the ectoderm. Thus, it is contemplated by the invention that 
compositions comprising hedgehog proteins can also be utilized for both cell culture and 
therapeutic methods involving generation and maintenance of non-neuronal tissue. 

In one embodiment, the present invention makes use of the discovery that hedgehog 
proteins, such as Shh, are apparently involved in controlling the development of stem cells 

30 responsible for formation of the digestive tract, liver, lungs, and other organs which derive 
from the primitive gut. As described in the Examples below, Shh serves as an inductive 
signal from the endoderm to the mesoderm, which is critical to gut morphogenesis. 
Therefore, for example, hedgehog agonists can be employed in the development and 
maintenance of an artificial liver which can have multiple metabolic functions of a normal 

35 liver. In an exemplary embodiment, hedgehog agonists can be used to induce differentiation 
of digestive tube stem cells to form hepatocyte cultures which can be used to populate 
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extracellular matrices, or which can be encapsulated in biocompatible polymers, to form both 
implantable and extracorporeal artificial livers. 

In another embodiment, therapeutic compositions of hedgehog agonists can be 
utilized in conjunction with transplantation of such artificial livers, as well as embryonic liver 
5 structures, to promote intraperitoneal implantation, vascularization, and in vivo differentiation 
and maintenance of the engrafted liver tissue. 

In yet another embodiment, hedgehog agonists can be employed therapeutically to 
regulate such organs after physical, chemical or pathological insult. For instance, therapeutic 
compositions comprising hedgehog agonists can be utilized in liver repair subsequent to a 
10 partial hepatectomy. Similarly, therapeutic compositions containing hedgehog agonists can 
be used to promote regeneration of lung tissue in the treatment of emphysema. 

•* 

*; In still another embodiment of the present invention, compositions comprising 

p hedgehog agonists can be used in the in vitro generation of skeletal tissue, such as from 
W skeletogenic stem cells, as well as the in vivo treatment of skeletal tissue deficiencies. The 
f! 15 present invention particularly contemplates the use of hedgehog agonists which maintain a 
*p. skeletogenic activity, such as an ability to induce chondrogenesis and/or osteogenesis. By 
V I "skeletal tissue deficiency", it is meant a deficiency in bone or other skeletal connective tissue 
yg at any site where it is desired to restore the bone or connective tissue, no matter how the 
* deficiency originated, e.g. whether as a result of surgical intervention, removal of tumor, 

20 ulceration, implant, fracture, or other traumatic or degenerative conditions. 

For instance, the present invention makes available effective therapeutic methods and 
5 compositions for restoring cartilage function to a connective tissue. Such methods are useful 

^ ! in, for example, the repair of defects or lesions in cartilage tissue which is the result of 

degenerative wear such as that which results in arthritis, as well as other mechanical 
25 derangements which may be caused by trauma to the tissue, such as a displacement of torn 
meniscus tissue, meniscectomy, a laxation of a joint by a torn ligament, malignment of joints, 
bone fracture, or by hereditary disease. The present reparative method is also useful for 
remodeling cartilage matrix, such as in plastic or reconstructive surgery, as well as 
periodontal surgery. The present method may also be applied to improving a previous 
30 reparative procedure, for example, following surgical repair of a meniscus, ligament, or 
cartilage. Furthermore, it may prevent the onset or exacerbation of degenerative disease if 
applied early enough after trauma. 

In one embodiment of the present invention, the subject method comprises treating 
the afflicted connective tissue with a therapeutically sufficient amount of a hedgehog agonist, 
35 particularly an Ihh agonist, to generate a cartilage repair response in the connective tissue by 
stimulating the differentiation and/or proliferation of chondrocytes embedded in the tissue. 
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Induction of chondrocytes by treatment with a hedgehog agonist can subsequently result in 
the synthesis of new cartilage matrix by the treated cells. Such connective tissues as articular 
cartilage, interarticular cartilage (menisci), costal cartilage (connecting the true ribs and the 
sternum), ligaments, and tendons are particularly amenable to treatment in reconstructive 

5 and/or regenerative therapies using the subject method. As used herein, regenerative 
therapies include treatment of degenerative states which have progressed to the point of 
which impairment of the tissue is obviously manifest, as well as preventive treatments of 
tissue where degeneration is in its earliest stages or imminent. The subject method can 
further be used to prevent the spread of mineralisation into fibrotic tissue by maintaining a 

1 0 constant production of new cartilage. 

In an illustrative embodiment, the subject method can be used to treat cartilage of a 
diarthroidal joint, such as a knee, an ankle, an elbow, a hip, a wrist, a knuckle of either a 
finger or toe, or a temperomandibular joint. The treatment can be directed to the meniscus of 
p the joint, to the articular cartilage of the joint, or both. To further illustrate, the subject 

15 method can be used to treat a degenerative disorder of a knee, such as which might be the 
If! result of traumatic injury (e.g., a sports injury or excessive wear) or osteoarthritis. An 

J injection of a hedgehog agonist into the joint with, for instance, an arthroscopic needle, can 

be used to treat the afflicted cartilage. In some instances, the injected agent can be in the 
form of a hydrogel or other slow release vehicle described above in order to permit a more 
20 extended and regular contact of the agent with the treated tissue. 

The present invention further contemplates the use of the subject method in the field 
of cartilage transplantation and prosthetic device therapies. To date, the growth of new 
cartilage from either transplantation of autologous or allogenic cartilage has been largely 
^ unsuccessful. Problems arise, for instance, because the characteristics of cartilage and 

25 fibrocartilage varies between different tissue: such as between articular, meniscal cartilage, 
ligaments, and tendons, between the two ends of the same ligament or tendon, and between 
the superficial and deep parts of the tissue. The zonal arrangement of these tissues may 
reflect a gradual change in mechanical properties, and failure occurs when implanted tissue, 
which has not differentiated under those conditions, lacks the ability to appropriately respond. 
30 For instance, when meniscal cartilage is used to repair anterior cruciate ligaments, the tissue 
undergoes a metaplasia to pure fibrous tissue. By promoting chondrogenesis, the subject 
method can be used to particularly addresses this problem, by causing the implanted cells to 
become more adaptive to the new environment and effectively resemble hypertrophic 
chondrocytes of an earlier developmental stage of the tissue. Thus, the action of 
35 chondrogensis in the implanted tissue, as provided by the subject method, and the mechanical 
forces on the actively remodeling tissue can synergize to produce an improved implant more 
suitable for the new function to which it is to be put 
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In similar fashion, the subject method can be applied to enhancing both the generation 
of prosthetic cartilage devices and to their implantation. The need for improved treatment 
has motivated research aimed at creating new cartilage that is based on collagen- 
glycosaminoglycan templates (Stone et al. (1990) Clin Orthop Relat Red 252:129), isolated 
chondrocytes (Grande et al. (1989) J Orthop Res 7:208; and Takigawa et al. (1987) Bone 
Miner 2:449), and chondrocytes attached to natural or synthetic polymers (Walitani et al. 
(1989) J Bone Jt Surg 71B:74; Vacanti et al. (1991) Plast Reconstr Surg 88:753; von 
Schroeder et al. (1991) JBiomed Mater Res 25:329; Freed et al (1993) J Biomed Mater Res 
27:1 1; and the Vacanti et al. U.S. Patent No. 5,041,138). For example, chondrocytes can be 
grown in culture on biodegradable, biocompatible highly porous scaffolds formed from 
polymers such as polyglycolic acid, polylactic acid, agarose gel, or other polymers which 
degrade over time as function of hydrolysis of the polymer backbone into innocuous 
monomers. The matrices are designed to allow adequate nutrient and gas exchange to the 
cells until engraftment occurs. The cells can be cultured in vitro until adequate cell volume 
and density has developed for the cells to be implanted. One advantage of the matrices is that 
they can be cast or molded into a desired shape on an individual basis, so that the final 
product closely resembles the patient's own ear or nose (by way of example), or flexible 
matrices can be used which allow for manipulation at the time of implantation, as in a joint. 

In one embodiment of the subject method, the implants are contacted with a hedgehog 
agonist during the culturing process, such as an Ihh agonist, in order to induce and/or 
maintain differentiated chondrocytes in the culture in order as to further stimulate cartilage 
matrix production within the implant. In such a manner, the cultured cells can be caused to 
maintain a phenotype typical of a chondrogenic cell (i.e. hypertrophic), and hence continue 
the population of the matrix and production of cartilage tissue. 

In another embodiment, the implanted device is treated with a hedgehog agonist in 
order to actively remodel the implanted matrix and to make it more suitable for its intended 
function. As set out above with respect to tissue transplants, the artificial transplants suffer 
from the same deficiency of not being derived in a setting which is comparable to the actual 
mechanical environment in which the matrix is implanted. The activation of the 
chondrocytes in the matrix by the subject method can allow the implant to acquire 
characteristics similar to the tissue for which it is intended to replace. 

In yet another embodiment, the subject method is used to enhance attachment of 
prosthetic devices. To illustrate, the subject method can be used in the implantation of a 
periodontal prosthesis, wherein the treatment of the surrounding connective tissue stimulates 
formation of periodontal ligament about the prosthesis, as well as inhibits formation of 
fibrotic tissue proximate the prosthetic device. 
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In still further embodiments, the subject method can be employed for the generation 
of bone (osteogenesis) at a site in the animal where such skeletal tissue is deficient. Indian 
hedgehog is particularly associated with the hypertrophic chondrocytes that are ultimately 
replaced by osteoblasts. For instance, administration of a hedgehog agent of the present 

5 invention can be employed as part of a method for treating bone loss in a subject, e.g. to 
prevent and/or reverse osteoporosis and other osteopenic disorders, as well as to regulate 
bone growth and maturation. For example, preparations comprising hedgehog agonists can 
be employed, for example, to induce endochondral ossification, at least so far as to facilitate 
the formation of cartilaginous tissue precursors to form the "model" for ossification. 

10 Therapeutic compositions of hedgehog agonists can be supplemented, if required, with other 
osteoinductive factors, such as bone growth factors (e.g. TGF-P factors, such as the bone 
morphogenetic factors BMP-2 and BMP-4, as well as activin), and may also include, or be 
administered in combination with, an inhibitor of bone resorption such as estrogen, 
bisphosphonate, sodium fluoride, calcitonin, or tamoxifen, or related compounds. However, 

15 it will be appreciated that hedgehog proteins, such as Ihh and Shh are likely to be upstream of 
BMPs, e.g. hh treatment will have the advantage of initiating endogenous expression of 
BMPs along with other factors. 

In yet another embodiment of the present invention, a hedgehog antagonist can be 
used to inhibit spermatogenesis. Thus, in light of the present finding that hedgehog proteins 

20 are involved in the differentiation and/or proliferation and maintenance of testicular germ 
cells, hedgehog antagonist can be utilized to block the action of a naturally-occurring 
hedgehog protein. In a preferred embodiment, the hedgehog antagonist inhibits the biological 
activity of Dhh with respect to spermatogenesis, by competitively binding hedgehog 
receptors in the testis. In similar fashion, hedgehog agonists and antagonists are potentially 

25 usefid for modulating normal ovarian function. 

The hedgehog protein, or a pharmaceutical^ acceptable salt thereof, may be 
conveniently formulated for administration with a biologically acceptable medium, such as 
water, buffered saline, polyol (for example, glycerol, propylene glycol, liquid polyethylene 
glycol and the like) or suitable mixtures thereof. The optimum concentration of the active 

30 ingredients) in the chosen medium can be determined empirically, according to procedures 
well known to medicinal chemists. As used herein, "biologically acceptable medium" 
includes any and all solvents, dispersion media, and the like which may be appropriate for the 
desired route of administration of the pharmaceutical preparation. The use of such media for 
pharmaceutically active substances is known in the art. Except insofar as any conventional 

35 media or agent is incompatible with the activity of the hedgehog protein, its use in the 
pharmaceutical preparation of the invention is contemplated. Suitable vehicles and their 
formulation inclusive of other proteins are described, for example, in the book Remington's 
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Pharmaceutical Sciences (Remington's Pharmaceutical Sciences. Mack Publishing Company, 
Easton, Pa., USA 1985). These vehicles include injectable "deposit formulations". Based on 
the above, such pharmaceutical formulations include, although not exclusively, solutions or 
freeze-dried powders of a hedgehog homolog (such as a Shh 9 Dhh or Mhh) in association with 
one or more pharmaceutical^ acceptable vehicles or diluents, and contained in buffered 
media at a suitable pH and isosmotic with physiological fluids. For illustrative purposes only 
and without being limited by the same, possible compositions or formulations which may be 
prepared in the form of solutions for the treatment of nervous system disorders with a 
hedgehog protein are given in U.S. Patent No. 5,218,094. In the case of freeze-dried 
preparations, supporting excipients such as, but not exclusively, mannitol or glycine may be 
used and appropriate buffered solutions of the desired volume will be provided so as to obtain 
adequate isotonic buffered solutions of the desired pH. Similar solutions may also be used 
for the pharmaceutical compositions of hh in isotonic solutions of the desired volume and 
include, but not exclusively, the use of buffered saline solutions with phosphate or citrate at 
suitable concentrations so as to obtain at all times isotonic pharmaceutical preparations of the 
desired pH, (for example, neutral pH). 

Pharmaceutical formulations of the present invention can also include veterinary 
compositions, e.g., pharmaceutical preparations of the hedgehog proteins, or bioactive 
fragments thereof, suitable for veterinary uses, e.g., for the treatment of live stock or domestic 
animals, e.g., dogs. 

Methods of introduction of exogenous hh at the site of treatment include, but are not 
limited to, intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, oral, 
intranasal and topical. In addition, it may be desirable to introduce the pharmaceutical 
compositions of the invention into the central nervous system by any suitable route, including 
intraventricular and intrathecal injection. Intraventricular injection may be facilitated by an 
intraventricular catheter, for example, attached to a reservoir, such as an Ommaya reservoir. 

Methods of introduction may also be provided by rechargeable or biodegradable 
devices. Various slow release polymeric devices have been developed and tested in vivo in 
recent years for the controlled delivery of drugs, including proteinacious biopharmaceuticals. 
A variety of biocompatible polymers (including hydrogels), including both biodegradable and 
non-degradable polymers, can be used to form an implant for the sustained release of an hh at 
a particular target site. Such embodiments of the present invention can be used for the 
delivery of an exogenously purified hedgehog protein, which has been incorporated in the 
polymeric device, or for the delivery of hedgehog produced by a cell encapsulated in the 
polymeric device. 
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An essential feature of certain embodiments of the implant can be the linear release of 
the hh, which can be achieved through the manipulation of the polymer composition and 
form. By choice of monomer composition or polymerization technique, the amount of water, 
porosity and consequent permeability characteristics can be controlled. The selection of the 
shape, size, polymer, and method for implantation can be determined on an individual basis 
according to the disorder to be treated and the individual patient response. The generation of 
such implants is generally known in the art. See, for example, Concise Encylopedia of 
Medical & Dental Materials, ed. by David Williams (MIT Press: Cambridge, MA, 1990); 
and the Sabel et al. U.S. Patent No. 4,883,666. In another embodiment of an implant, a 
source of cells producing a hedgehog protein, or a solution of hydogel matrix containing 
purified hh> is encapsulated in implantable hollow fibers. Such fibers can be pre-spun and 
subsequently loaded with the hedgehog source (Aebischer et al. U.S. Patent No. 4,892,538; 
Aebischer et al. U.S. Patent No. 5,106,627; Hoffinan et al. (1990) Expt. Neurobiol 110:39- 
44; Jaeger et al. (1990) Prog. Brain Res. 82:41-46; and Aebischer et al. (1991) J. Biomech 
Eng. 113:178-183), or can be co-extruded with a polymer which acts to form a polymeric 
coat about the hh source (Lim U.S. Patent No. 4,391,909; Sefton U.S. Patent No. 4,353,888; 
Sugamori et al. (1989) Trans. Am. Artif Intern. Organs 35:791-799; Sefton et al. (1987) 
Biotehnol Bioeng. 29:1135-1143; and Aebischer et al. (1991) Biomaterials 12:50-55). 

In yet another embodiment of the present invention, the pharmaceutical hedgehog 
protein can be administered as part of a combinatorial therapy with other agents. For 
example, the combinatorial therapy can include a hedgehog protein with at least one trophic 
factor. Exemplary trophic factors include nerve growth factor, cilliary neurotrophic growth 
factor, schwanoma-derived growth factor, glial growth factor, stiatal-derived neuronotrophic 
factor, platelet-derived growth factor, and scatter factor (HGF-SF). Antimitogenic agents can 
also be used, for example, when proliferation of surrounding glial cells or astrocytes is 
undesirable in the regeneration of nerve cells. Examples of such antimitotic agents include 
cytosine, arabinoside, 5-fluorouracil, hydroxyurea, and methotrexate. 

Another aspect of the invention features transgenic non-human animals which express 
a heterologous hedgehog gene of the present invention, or which have had one or more 
genomic hedgehog genes disrupted in at least one of the tissue or cell-types of the animal. 
Accordingly, the invention features an animal model for developmental diseases, which 
animal has hedgehog allele which is mis-expressed. For example, a mouse can be bred which 
has one or more hh alleles deleted or otherwise rendered inactive. Such a mouse model can 
then be used to study disorders arising from mis-expressed hedgehog genes, as well as for 
evaluating potential therapies for similar disorders. 

Another aspect of the present invention concerns transgenic animals which are 
comprised of cells (of that animal) which contain a transgene of the present invention and 
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which preferably (though optionally) express an exogenous hedgehog protein in one or more 
cells in the animal. A hedgehog transgene can encode the wild-type form of the protein, or 
can encode homologs thereof, including both agonists and antagonists, as well as antisense 
constructs. In preferred embodiments, the expression of the transgene is restricted to specific 
subsets of cells, tissues or developmental stages utilizing, for example, cis-acting sequences 
that control expression in the desired pattern. In the present invention, such mosaic 
expression of a hedgehog protein can be essential for many forms of lineage analysis and can 
additionally provide a means to assess the effects of, for example, lack of hedgehog 
expression which might grossly alter development in small patches of tissue within an 
otherwise normal embryo. Toward this and, tissue-specific regulatory sequences and 
conditional regulatory sequences can be used to control expression of the transgene in certain 
spatial patterns. Moreover, temporal patterns of expression can be provided by, for example, 
conditional recombination systems or prokaryotic transcriptional regulatory sequences. 

Genetic techniques which allow for the expression of transgenes can be regulated via 
site-specific genetic manipulation in vivo are known to those skilled in the art. For instance, 
genetic systems are available which allow for the regulated expression of a recombinase that 
catalyzes the genetic recombination a target sequence. As used herein, the phrase "target 
sequence" refers to a nucleotide sequence that is genetically recombined by a recombinase. 
The target sequence is flanked by recombinase recognition sequences and is generally either 
excised or inverted in cells expressing recombinase activity. Recombinase catalyzed 
recombination events can be designed such that recombination of the target sequence results 
in . either the activation or repression of expression of one of the subject hedgehog proteins. 
For example, excision of a target sequence which interferes with the expression of a 
recombinant hh gene, such as one which encodes an antagonistic homolog or an antisense 
transcript, can be designed to activate expression of that gene. This interference with 
expression of the protein can result from a variety of mechanisms, such as spatial separation 
of the hh gene from the promoter element or an internal stop codon. Moreover, the transgene 
can be made wherein the coding sequence of the gene is flanked by recombinase recognition 
sequences and is initially transfected into cells in a 3 T to 5 ? orientation with respect to the 
promoter element. In such an instance, inversion of the target sequence will reorient the 
subject gene by placing the 5* end of the coding sequence in an orientation with respect to the 
promoter element which allow for promoter driven transcriptional activation. 

In an illustrative embodiment, either the crelloxP recombinase system of 
bacteriophage PI (Lakso et al. (1992) PNAS 89:6232-6236; Orban et al. (1992) PNAS 
89:6861-6865) or the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al. 
(1991) Science 251:1351-1355; PCT publication WO 92/15694) can be used to generate in 
vivo site-specific genetic recombination systems. Cre recombinase catalyzes the site-specific 
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recombination of an intervening target sequence located between loxP sequences. loxP 
sequences are 34 base pair nucleotide repeat sequences to which the Cre recombinase binds 
and are required for Cre recombinase mediated genetic recombination. The orientation of 
loxP sequences determines whether the intervening target sequence is excised or inverted 
5 when Cre recombinase is present (Abremski et al. (1984) J. Biol Chem. 259:1509-1514); 
catalyzing the excision of the target sequence when the loxP sequences are oriented as direct 
repeats and catalyzes inversion of the target sequence when loxP sequences are oriented as 
inverted repeats. 

Accordingly, genetic recombination of the target sequence is dependent on expression 
10 of the Cre recombinase. Expression of the recombinase can be regulated by promoter 
elements which are subject to regulatory control, e.g., tissue-specific, developmental 
stage-specific, inducible or repressible by externally added agents. This regulated control 
will result in genetic recombination of the target sequence only in cells where recombinase 
expression is mediated by the promoter element. Thus, the activation expression of a 
1 5 recombinant hedgehog protein can be regulated via control of recombinase expression. 

Use of the crelloxP recombinase system to regulate expression of a recombinant hh 
protein requires the construction of a transgenic animal containing transgenes encoding both 
the Cre recombinase and the subject protein. Animals containing both the Cre recombinase 
and a recombinant hedgehog gene can be provided through the construction of "double" 
20 transgenic animals. A convenient method for providing such animals is to mate two 
transgenic animals each containing a transgene, e.g., an hh gene and recombinase gene. 

One advantage derived from initially constructing transgenic animals containing a 
hedgehog transgene in a recombinase-mediated expressible format derives from the 
likelihood that the subject protein, whether agonistic or antagonistic, can be deleterious upon 

25 expression in the transgenic animal. In such an instance, a founder population, in which the 
subject transgene is silent in all tissues, can be propagated and maintained. Individuals of 
this founder population can be crossed with animals expressing the recombinase in, for 
example, one or more tissues and/or a desired temporal pattern. Thus, the creation of a 
founder population in which, for example, an antagonistic hh transgene is silent will allow 

30 the study of progeny from that founder in which disruption of hedgehog mediated induction 
in a particular tissue or at certain developmental stages would result in, for example, a lethal 
phenotype. 

Similar conditional transgenes can be provided using prokaryotic promoter sequences 
which require prokaryotic proteins to be simultaneous expressed in order to facilitate 
35 expression of the hedgehog transgene. Exemplary promoters and the corresponding trans- 
activating prokaryotic proteins are given in U.S. Patent No. 4,833,080. 
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Moreover, expression of the conditional transgenes can be induced by gene therapy- 
like methods wherein a gene encoding the trans-activating protein, e.g. a recombinase or a 
prokaiyotic protein, is delivered to the tissue and caused to be expressed, such as in a cell- 
type specific manner. By this method, a hedgehog transgene could remain silent into 
5 adulthood until "turned on" by the introduction of the trans-activator. 

In an exemplary embodiment, the "transgenic non-human animals" of the invention 
are produced by introducing transgenes into the germline of the non-human animal. 
Embryonic target cells at various developmental stages can be used to introduce transgenes. 
Different methods are used depending on the stage of development of the embryonic target 
10 cell. The zygote is the best target for micro-injection. In the mouse, the male pronucleus 
reaches the size of approximately 20 micrometers in diameter which allows reproducible 
injection of l-2pl of DNA solution. The use of zygotes as a target for gene transfer has a 
major advantage in that in most cases the injected DNA will be incorporated into the host 
p gene before the first cleavage (Brinster et al. (1985) PNAS 82:4438-4442). As a consequence, 

^ 15 all cells of the transgenic non-human animal will carry the incorporated transgene. This will 
in general also be reflected in the efficient transmission of the transgene to offspring of the 
founder since 50% of the germ cells will harbor the transgene. Microinjection of zygotes is 

j£j the preferred method for incorporating transgenes in practicing the invention. 

Q 

fe Retroviral infection can also be used to introduce hedgehog transgenes into a non- 

p 20 human animal. The developing non-human embryo can be cultured in vitro to the blastocyst 
'M stage. During this time, the blastomeres can be targets for retroviral infection (Jaenich, R. 

jS (1976) PNAS 73:1260-1264). Efficient infection of the blastomeres is obtained by enzymatic 

^0 treatment to remove the zona pellucida (Manipulating the Mouse Embryo, Hogan eds. (Cold 

^ Spring Harbor Laboratory Press, Cold Spring Harbor, 1986). The viral vector system used to 

25 introduce the transgene is typically a replication-defective retrovirus carrying the transgene 
(Jahner et al. (1985) PNAS 82:6927-693 1; Van der Putten et al. (1985) PNAS 82:6148-6152). 
Transfection is easily and efficiently obtained by culturing the blastomeres on a monolayer of 
virus-producing cells (Van der Putten, supra; Stewart et al. (1987) EMBO J. 6:383-388). 
Alternatively, infection can be performed at a later stage. Virus or virus-producing cells can 
30 be injected into the blastocoele (Jahner et al. (1982) Nature 298:623-628). Most of the 
founders will be mosaic for the transgene since incorporation occurs only in a subset of the 
cells which formed the transgenic non-human animal. Further, the founder may contain 
various retroviral insertions of the transgene at different positions in the genome which 
generally will segregate in the offspring. In addition, it is also possible to introduce 
35 transgenes into the germ line by intrauterine retroviral infection of the midgestation embryo 
(Jahner et al. (1982) supra). 
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A third type of target cell for transgene introduction is the embryonic stem cell (ES). 
ES cells are obtained from pre-implantation embryos cultured in vitro and fused with 
embryos (Evans et al. (1981) Nature 292:154-156; Bradley et al. (1984) Nature 309:255-258; 
Gossler et al. (1986) PNAS 83: 9065-9069; and Robertson et al. (1986) Nature 322:445-448). 
Transgenes can be efficiently introduced into the ES cells by DNA transfection or by 
retrovirus-mediated transduction. Such transformed ES cells can thereafter be combined with 
blastocysts from a non-human animal. The ES cells thereafter colonize the embryo and 
contribute to the germ line of the resulting chimeric animal. For review see Jaenisch,.R. 
(1988) Science 240:1468-1474. 

Methods of making hedgehog knock-out or disruption transgenic animals are also 
generally known. See, for example, Manipulating the Mouse Embryo, (Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1986). Recombinase dependent knockouts can 
also be generated, e.g. by homologous recombination to insert recombinase target sequences 
flanking portions of an endogenous hh gene, such that tissue specific and/or temporal control 
of inactivation of a hedgehog allele can be controlled as above. 

Exemplification 

The invention, now being generally described, will be more readily understood by 
reference to the following examples, which are included merely for purposes of illustration of 
certain aspects and embodiments of the present invention and are not intended to limit the 
invention. 

Example 1 

Cloning and Expression of Chick Sonic Hedgehog 
(i) Experimental Procedures 

Using degenerate PCR primers, vHHSO (SEQ ID No:18), vHH3Q (SEQ ID No:19) 
and vHH3I (SEQ ID No:20) corresponding to a sequence conserved between Drosophila 
hedgehog (SEQ ID No:34) (Lee, J.J. et al. (1992) Cell 71: 33-50; Mohler, J. et al., (1992) 
Development 115: 957-971) and mouse Indian hedgehog (Ihh) (SEQ ID No: 10), a 220 base 
pair (bp) fragment was amplified from chicken genomic DNA. From 15 isolates, two distinct 
sequences were cloned, pCHA (SEQ ID No:35) and pCHB (SEQ ID No:36), each highly 
homologous to mouse Ihh (Figure 1). A probe made from isolate pCHA did not detect 
expression in embryonic tissues. Isolate pCHB, however, detected a 4 kb message in RNA 
prepared from embryonic head, trunk, or limb bud RNA. This cloned PCR fragment was 
therefore used as a probe to screen an unamplified cDNA library prepared from Hamburger 
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Hamilton stage 22 (Hamburger, W. et al., (1951) J. Morpk 88: 49-92) limb bud RNA as 
described below. 

A single 1.6 kilobase (kb) cDNA clone, pHH-2, was selected for characterization and 
was used in all subsequent analyses. The gene encoding for this cDNA was named Sonic 
5 Hedgehog (after the Sega computer game cartoon character). Sequencing of the entire cDNA 
confirmed the presence of a single long open reading frame potentially encoding for a protein 
of 425 amino acids (aa). The clone extends 220 bp upstream of the predicted initiator 
methionine and approximately 70 bp beyond the stop codon. No consensus polyadenylation 
signal could be identified in the 3' untranslated region. A second potential initiator 

10 methionine occurs at amino acid residue 4. The putative translation initiation signals 
surrounding both methionines are predicted to be equally efficient (Kozak, M, (1987) Nuc. 
Acids Res. 15: 8125-8132). When the pHH-2 Sonic cDNA is used to probe a northern blot of 
stage 24 embryonic chick RNA, a single mRNA species of approximately 4 kb is detected in 
both limb and trunk tissue. The message size was predicted by comparing it to the position 

15 of 18S and 28S ribosomal RNA. Hybridized mRNA was visualized after a two day exposure 
to a phosphoscreen. Because the Sonic cDNA clone pHH-2 is only 1.6 kb, it is likely to be 
missing approximately 2.4 kb of untranslated sequence. 

PCR Cloning 

All standard cloning techniques were performed according to Ausubel et. al. (1989), 
20 and all enzymes were obtained from Boehringer Mannheim Biochemicals. Degenerate 
oligonucleotides corresponding to amino acid residues 161 to 237 of the Drosophila 
hedgehog protein (SEQ ID No:34) (Lee, J.J. et. al., (1992) Cell 71: 33-50) were synthesized. 
These degenerate oligonucleotides, vHHSO (SEQ ID No: 18), vHH30 (SEQ ID No: 19), and 
vHH3I (SEQ ID No:20) also contained Eco RI, Cla I, and Xba I sites, respectively, on their 5' 
25 ends to facilitate subcloning. The nucleotide sequence of these oligos is given below: 

vHH50: 5-GGAATrCCCAG(CA)GITG(CT)AA(AG)GA(AG)(CA)(AG)I(GCT)IAA-3 t 
vHTOO: 5'-TCATCGATGGACCCA(GA)TC(GA)AAICCIGC(TC)TC-3' 
vHH3I: 5 , -GCTCTAGAGCTCIACIGCIA(GA)IC(GT)IGC-3 f 

30 where I represents inosine. Nested PCR was performed by first amplifying chicken genomic 
DNA using the vHHSO and vHH30 primer pair and then further amplifying that product 
using the vHHSO and vHH3I primer pair. In each case the reaction conditions were: initial 
denaturation at 93° C for 2.5 min., followed by 30 cycles of 94° C for 45 s, 50° C for 1 min., 
72° C for 1, and a final incubation of 72° C for 5 min. The 220 bp PCR product was 

35 subcloned into pGEM7zf (Promega). Two unique clones, pCHA (SEQ ID No:35) and pCHB 
(SEQ ID No:36) were identified. 



-88- 

DNA Sequence Analysis 

Nucleotide sequences were determined by the dideoxy chain termination method 
(Sanger, F. et al., (1977) Proc. Natl Acad ScL USA 74: 5463-5467) using Sequenase v2.0 T7 
DNA polymerase (US Biochemicals). 5* and 3* nested deletions of pHH-2 were generated by 
using the nucleases Exo III and SI (Erase a Base, Promega) and individual subclones 
sequenced. DNA and amino acid sequences were analyzed using both GCG (Devereux, J. et 
al., (1984) Nuc. Acids Res. 12: 387-394) and DNAstar software. Searches for related 
sequences were done through the BLAST network service (Altschul, S.R et al., (1990) J. 
Mol Biol 215: 403-410) provided by the National Center for Biotechnology Information. 



Southern Blot Analysis 

Five (5) ^ig of chick genomic DNA was digested with Eco RI and/or Bam HI, 
fractionated on a 1% agarose gel, and transferred to a nylon membrane (Genescreen, New 
England Nuclear). The filters were probed with 32 P-labeled hM or hhb at 42°C in 
hybridization buffer (0.5% BSA, 500 mM NaHP0 4 , 7% SDS, 1 mM EDTA, pH 7.2; Church, 
G.M et al., (1984) Proc, Natl Acad. ScL USA 81: 1991-1995). The blots were washed at 63° 
C once in 0.5% bovine serum albumin, 50 mM NaHP04 (pH 7.2), 5% SDS, 1 mM EDTA 
and twice in 40 mM NaHP04 (pH 7.2), 1% SDS, ImM EDTA, and visualized on Kodak 
XAR-5 film. 

Isolation Of Chicken Sonic cDNA Clones 

A stage 22 limb bud cDNA library was constructed in XgtlO using Eco RI/NotI 
linkers. Unamplified phage plaques (10 6 ) were transferred to nylon filters (Colony/Plaque 
screen, NEN) and screened with a 32 P-labelled pooled inserts from PCR clones pCHA (SEQ 
ID No:35) and pCHB (SEQ ID No:36). Hybridization was performed at 42° C in 50% 
formamide 2X SSC, 10% dextran sulfate, 1% SDS and washing as described in the Southern 
Blot procedure. Eight positive plaques were identified, purified and their cDNA inserts 
excised with EcoRI and subcloned into pBluescript SK> (Stratagene). All eight had 
approximately 1.7 kb inserts with identical restriction patterns. One, pHH-2, was chosen for 
sequencing and used in all further manipulations. 

Preparation Of Digoxigenin-Labeled Riboprobes 

Plasmid pHH-2 was linearized with Hind III and transcribed with T3 RNA 
polymerase (for antisense probes) or with Bam HI and transcribed with T7 RNA polymerase 
according to the manufacturers instructions for the preparation of non-radioactive 
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digoxigenin transcripts. Following the transcription reaction, RNA was precipitated, and 
resuspended in RNAse-free water. 

Whole Mount In Situ Hybridization 

Whole-mount in situ hybridization was performed using protocols modified from 
Parr, B.A. et al.. (1993) Development 119: 247-261; Sasaki, H. et al. (1993) Development 
118: 47-59; Rosen, B. et al. (1993) Trends Genet 9: 162-167. Embryos from incubated 
fertile White Leghorn eggs (Spafas) were removed from the egg and extra-embryonic 
membranes dissected in calcium/magnesium-free phosphate-buffered saline (PBS) at room 
temperature. Unless otherwise noted, all washes are for five minutes at room temperature. 
Embryos were fixed overnight at 4°C with 4% paraformaldehyde in PBS, washed twice with 
PBT (PBS with 0.1% Tween-20) at 4°C, and dehydrated through an ascending methanol 
series in PBT (25%, 50%, 75%, 2 X 100% methanol). Embryos were stored at -20°C until 
further use. 

Both pre-limb bud and limb bud stage embryos were rehydrated through an 
descending methanol series followed by two washes in PBT. Limb bud stage embryos were 
bleached in 6% hydrogen peroxide in PBT, washed three times with PBT, permeabilized with 
proteinase K (Boehringer, 2 ^ig/ml) for 15 minutes, washed with 2 mg/ml glycine in PBT for 
10 minutes, and twice with PBT. Pre-limb bud stage embryos were permealibized (without 
prior incubation with hydrogen peroxide) by three 30 minute washes in RIPA buffer (150 
mM NaCl, 1% NP-40, 0.5% Deoxycholate, 0.1% SDS, ImM EDTA, 50 mM Tris-HCl, pH 
8.0). In all subsequent steps, pre-limb bud and limb bud stage embryos were treated 
equivalently. Embryos were fixed with 4% paraformaldehyde/0.2% gluteraldehyde in PBT, 
washed four times with PBT, once with pre-hybridization buffer (50% formamide, 5 X SSC, 
1% SDS, 50 fig/ml total yeast RNA, 50 ng/ml heparin, pH 4.5), and incubated with fresh pre- 
hybridization buffer for one hour at 70°C. The pre-hybridization buffer was then replaced 
with hybridization buffer (pre-hybridization buffer with digoxigenin labeled riboprobe at 1 
jig/ml) and incubated overnight at 70°C. 

Following hybridization, embryos were washed 3 X 30 minutes at 70°C with solution 
1 (50% formamide, 5 X SSC, 1% SDS, pH 4.5), 3 X 30 minutes at 70°C with solution 3 
30 (50% formamide, 2 X SSC, pH 4.5), and three times at room temperature with TBS (Tris- 
buffered saline with 2 mM levamisole) containing 0.1% Tween-20. Non-specific binding of 
antibody was prevented by preblocking embryos in TBS/0.1% Tween-20 containing 10% 
heat-inactivated sheep serum for 2.5 hours at room temperature and by pre-incubating anti- 
digoxigenin Fab alkaline-phosphatase conjugate (Boehringer) in TBS/0.1% Tween-20 
35 containing heat inactivated 1% sheep serum and approximately 0.3% heat inactivated chick 
embryo powder. After an overnight incubation at 4°C with the pre-adsorbed antibody in 
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TBS/0.1% Tween-20 containing 1% sheep serum, embryos were washed 3X5 minutes at 
room temperature with TBS/0.1% Tween-20, 5 X 1.5 hour room temperature washes with 
TBS/1% Tween-20, and overnight with TBS/1% Tween-20 at 4°C. The buffer was 
exchanged by washing 3 X 10 minutes with NTMT (lOOmM NaCl, 100 mM Tris-HCl, 50 
mM MgC12, 0.1% Tween-20, 2 mM levamisole). The antibody detection reaction was 
performed by incubating embryos with detection solution (NTMT with 0.25 mg/ml NBT and 
0.13 mg/ml X-Phos). In general, pre-limb bud stage embryos were incubated for 5-15 hours 
and limb bud stage embryos 1-5 hours. After the detection reaction was deemed complete, 
embryos were washed twice with NTMT, once with PBT (pH 5.5), postfixed with 4% 
paraformaldehyde/0.1% gluteraldehyde in PBT, and washed several times with PBT. In 
some cases embryos were cleared through a series of 30%, 50%, 70%, and 80% glycerol in 
PBT. Whole embryos were photographed under transmitted light using a Nikon zoom stereo 
microscope with Kodak Ektar 100 ASA film. Selected embryos were processed for frozen 
sections by dehydration in 30% sucrose in PBS followed by embedding in gelatin and 
freezing. 25 jim cryostat sections were collected on superfrost plus slides (Fisher), 
rehydrated in PBS, and mounted with gelvatol. Sections were photographed with Nomarski 
optics using a Zeiss Axiophot microscope and Kodak Ektar 25 ASA film. 

(ii) Sequence Homolgy Comparison Between Chicken Sonic hh And Drosophila hh And 
Other Vertebrate Sonic hh Proteins 

The deduced Sonic amino acid sequence (SEQ ID No:8) is shown and compared to 
the Drosophila hedgehog protein (SEQ ID No:34) in Figure 2. Over the entire open reading 
frame the two proteins are 48% homologous at the amino acids level. The predicted 
Drosophila protein extends 62 aa beyond that of Sonic at its amino terminus. This N- 
terminal extension precedes the putative signal peptide (residues 1-26) of the fly protein 
(SEQ ID No:34), and has been postulated to be removed during processing of the secreted 
form of Drosophila hedgehog (Lee, JJ. et al., (1992) Cell 71: 33-50). The sequence of 
residues 1-26 of the Sonic protein (SEQ ID No:8) matches well with consensus sequences for 
eukaiyotic signal peptides (Landry, S.J. et al., (1993) Trends, Biochem, Set 16: 159-163) and 
is therefore likely to serve that function for Sonic. Furthermore, Figure 3 shows a hydropathy 
plot (Kyte, J. et al., (1982) J. Mol Biol.157: 133-148) indicating that residues 1-26 of the 
Sonic protein (SEQ ID No:8) exhibit a high hydrophobic moment in accord with identified 
eukaryotic signal peptides. Cleavage of the putative signal sequence should occur C-terminal 
to residue 26 according to the predictive method of von Henjie, G. (1986) NucL Acid Res. 
1 1 : 1986. A single potential N-linked glycosylation site is located at amino acid residue 282 
of the Sonic protein (SEQ ID No:8). The predicted Sonic protein does not contain any other 
strong consensus motifs, and is not homologous to any other proteins outside of the 
Hedgehog family. 
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The mouse (SEQ ID No:l 1) and zebrafish (SEQ ID No: 12) homologs of Sonic have 
also been isolated. A comparison of these and the Drosophila sequence is shown 
schematically in Figure 4. All of the vertebrate proteins have a similar predicted structure: a 
putative signal peptide at their amino terminus, followed by an extraordinarily similar 182 
amino acid region (99% identity in chicken versus mouse and 95% identity in chicken versus 
zebrafish) and a less well conserved carboxy-terminal region. 

(Hi) At Least Three Hedgehog Homologies Are Present In The Chicken Genome 

Since two distinct PCR products encoding for chicken hedgehogs were amplified 
from genomic DNA, the total number of genes in the chicken hedgehog family needed to be 
estimated. The two PCR clones pCHA (SEQ ID No:35) and pCHB (SEQ ID No:36) were 
used to probe a genomic Southern blot under moderately stringent conditions as described in 
the above Experimental Procedures. The blot was generated by digesting 5 jxg of chick 
chromosomal DNA with EcoRI and BamHI alone and together. Each probe reacted most 
strongly with a distinct restriction fragment. For example, the blot probed with pCHA, 
shows three bands in each of the Bam HI lanes, one strong at 6.6 kb and two weak at 3.4 and 
2.7 kb. The blot probed with pCHB, shows the 2.7 kb band as the most intense, while the 3.4 
and 6.6 kb bands are weaker. A similar variation of intensities can also be seen in the Bam 
HI/Eco RI and EcoRI lanes. Exposure times were 72 hr. This data indicates that each probe 
recognizes a distinct chicken hedgehog gene, and that a third as yet uncharacterized chicken 
hedgehog homolog exists in the chicken genome. 

(iv) Northern Analysis Defining Sites Of Sonic Transcription 

Northern analysis was performed which confirmed that Sonic is expressed during 
chick development. The spatial and temporal expression of Sonic in the chick embryo from 
gastrulation to early organogenesis was determined by whole mount in situ hybridization 
using a riboprobe corresponding to the full-length Sonic cDNA (SEQ ID No:l). 

20ng total RNA isolated from stage 24 chick leg buds or bodies (without heads or 
limbs) was fractionated on a 0.8% agarose formaldehyde gel and transferred to a nylon 
membrane (Hybond N, Amersham). The blot was probed with the 1 .6 kb EcoRI insert from 
pHH-2. Random-primed a 32 P-labelled insert was hybridized at 42°C hybridization buffer 
(1% BSA, 500mM NaHP0 4 , 7% SDS, 1 mM EDTA, pH 7.2) and washed at 63° C once in 
0.5% bovine serum albumin, 50 mM NaHP04 (pH 7.2), 5% SDS, 1 mM EDTA and once in 
40 mM NaHP04 (pH 7.2), 1% SDS, ImM EDTA. The image was visualized using a 
phosphoimager (Molecular Dynamics) and photographed directly from the video monitor. 

(v) Expression Of Sonic During Mid-Gastrulation 
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Sonic message is detected in the gastrulating blastoderm at early stage 4, the earliest 
stage analyzed. Staining is localized to the anterior end of the primitive streak in a region 
corresponding to Hensen's node. As gastrulation proceeds, the primitive streak elongates to 
its maximal cranial-caudal extent, after which Hensen ! s node regresses caudally and the 
primitive streak shortens. At an early point of node regression, Sonic mRNA can be detected 
at the node and in midline cells anterior to the node. By late stage 5, when the node has 
migrated approximately one-third of the length of the fully elongated primitive streak, 
prominent Sonic expression is seen at the node and in the midline of the embryo, reaching its 
anterior limit at the developing head process. Sections at a cranial level show that Sonic 
mRNA is confined to invaginated axial mesendoderm, tissue which contributes to foregut 
and notochord. More caudally, but still anterior to Hensen's node, staining of axial 
mesoderm is absent and Sonic expression is confined to the epiblast. At the node itself, high 
levels of Sonic message are observed in an asymmetric distribution extending to the left of 
and posterior to the primitive pit This asymmetric distribution is consistently observed (6/6 
embryos from stages 5-7) and is always located to the left of the primitive pit. At the node, 
and just posterior to the node, Sonic expression is restricted to the epiblast and is not 
observed in either mesoderm or endodenn. The expression of Sonic in the dorsal epiblast 
layer without expression in underlying axial mesoderm contrasts markedly with later stages 
where Sonic expression in underlying mesoderm always precedes midline neural tube 
expression. 

(vi) Expression Of Sonic During Head Fold Stages 

During the formation and differentiation of the head process, Sonic mRNA is detected 
in midline cells of the neural tube, the foregut, and throughout most of the axial mesoderm. 
At stage 7, Sonic message is readily detected asymmetrically at the node and in ventral 
midline cells anterior to the node. The rostral limit of Sonic expression extends to the 
anterior-most portions of the embryo where it is expressed in the foregut and prechordal 
mesoderm (Adelmann, H.B., (1932) Am. J. Anat 31, 55-101). At stage 8, expression of 
Sonic persists along the entire ventral midline anterior to Hensen's node, while the node 
region itself no longer expresses Sonic. Transverse sections at different axial levels reveal 
that at stage 8 Sonic is coexpressed in the notochord and the overlying ventromedial 
neuroectoderm from anterior to Hensen's node to the posterior foregut. The levels of Sonic 
message are not uniform in the neural tube: highest levels are found at the presumptive mid- 
and hindbrain regions with progressively lower levels anterior and posterior. The increasing 
graded expression in the neural tube from Hensen's node to the rostral brain may reflect the 
developmental age of the neuroectoderm as differentiation proceeds from posterior to 
anterior. At the anterior-most end of the embryo, expression is observed in midline cells of 
the dorsal and ventral foregut as well as in prechordal mesoderm. Although the prechordal 
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mesoderm is in intimate contact with the overlying ectoderm, the latter is devoid of Sonic 
expression. 

(vii) Expression Of Sonic Daring Early CNS Differentiation 

At stages 10 through 14, Sonic expression is detected in the notochord, ventral neural 
tube (including the floor plate), and gut precursors. By stage 10, there is a marked expansion 
of the cephalic neuroectoderm, giving rise to the fore- mid- and hind-brain. At stage 10, 
Sonic mRNA is abundantly expressed in the ventral midline of the hindbrain and posterior 
midbrain. This expression expands laterally in the anterior midbrain and posterior forebrain. 
Expression does not extend to the rostral forebrain at this or later stages. Sections reveal that 
Sonic is expressed in the notochord, the prechordal mesoderm, and the anterior midline of the 
foregut Expression in the neuroepithelium extends from the forebrain caudally. In the 
posterior-most regions of the embryo which express Sonic, staining is found only in the 
notochord and not in the overlying neurectoderm. This contrasts with earlier expression in 
which the posterior domains of Sonic expression contain cells are located in the dorsal 
epiblast, but not in underlying mesoderm or endoderm. Midgut precursors at the level of the 
anterior intestinal portal also show weak Sonic expression. 

At stage 14, expression continues in all three germ layers. The epithelium of the 
closing midgut expresses Sonic along with portions of the pharyngeal endoderm and anterior 
foregut Ectoderm lateral and posterior to the tail bud also exhibits weak expression. At this 
stage, Sonic is also expressed along entire length of the notochord which now extends 
rostrally only to the midbrain region and no longer contacts the neuroepithelium at the 
anterior end of the embryo. Expression in head mesenchyme anterior to the notochord is no 
longer observed. In the neural tube Sonic is found along the ventral midline of the fore- mid- 
and hindbrain and posteriorly in the spinal cord. In the forebrain, expression is expanded 
laterally relative to the hindbrain. At midgut levels, expression of Sonic in the neural tube 
appears to extend beyond the floor plate into more lateral regions. As observed at stage 10, 
Sonic at stage 14 is found in the notochord, but not in the ventral neural tube in posterior- 
most regions of the embryo. When neuroectodermal expression is first observed in the 
posterior embryo, it is located in midline cells which appear to be in contact with the 
notochord. At later stages, expression continues in areas which show expression at stage 14, 
namely the CNS, gut epithelium including the allantoic stalk, and axial mesoderm. 

(viii) Sonic Is Expressed In Posterior Limb Bud Mesenchyme 

The limb buds initially form as local thickenings of the lateral plate mesoderm. As 
distal outgrowth occurs during stage 17, Sonic expression becomes apparent in posterior 
regions of both the forelimb and the hindlimb. Sections through a stage 21 embryo at the 
level of the forelimbs reveal that expression of Sonic in limb buds is limited to mesenchymal 
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tissue. A more detailed expression profile of Sonic during limb development is discussed 
below in Example 3. Briefly, as the limb bud grows out, expression of 'Sonic narrows along 
the anterior-posterior axis to become a thin stripe along the posterior margin closely apposed 
to the ectoderm. Expression is not found at more proximal regions of the bud. High levels of 
Sonic expression are maintained until around stage 25/26 when staining becomes weaker. 
Expression of Sonic is no longer observed in wing buds or leg buds after stage 28. 

Example 2 

Mouse Sonic Hedgehog Is Implicated in the Regulation ofCNS and Limb Polarity 

(i) Experimental Procedures 

Isolation Of Hedgehog Phage Clones 

The initial screen for mammalian hh genes was performed, as above, using a 700bp 
PCR fragment encompassing exons 1 and 2 of the Drosophila hh gene. Approximately one 
million plaques of a 129/Sv Lambda Fix II genomic library (Stratagene) were hybridized 
with an a 32 P-dATP labeled probe at low stringency ( 55°C in 6xSSC, 0.5%SDS, 5 x 
Denhardt's; final wash at 60°C in 0.5 x SSC, 0.1% SDS for 20'). Five cross hybridizing 
phage plaques corresponding to the Dhh gene were purified. Restriction enzyme analysis 
indicated that all clones were overlapping. Selected restriction enzyme digests were then 
performed to map and subclone one of these. Subclones in pGEM (Promega) or Bluescript 
(Stratagene) which cross-hybridized with the Drosophila hh fragment where sequenced using 
an ABI automatic DNA sequencer. 

Mouse Ihh and Shh were identified by low stringency hybridization (as described 
above) with a chick Shh cDNA clone to one million plaques of an 8.5 day JigtlO mouse 
embryo cDNA library (Fahrner, K. et al., (1987) EMBO J. 6: 1265-1271). Phage plaques 
containing a 1.8kb Ihh and 0.64 and 2.8kb Shh inserts were identified. Inserts were excised 
and subcloned into Bluescript (Stratagene) for dideoxy chain termination sequencing using 
modified T7 DNA polymerase (USB). The larger Shh clone contained a partially processed 
cDNA in which intron splicing at the exon 1/2 junction had not occurred. 

To screen for additional Ihh and Shh cDNA clones, an 8.5 day AZAPII cDNA library 
was probed at high stringency (at 65°C in 6xSSC 9 0.5% SDS, 5 x Denhardt's; final wash at 65 
°C in O.lxSSC, 0.1% SDS for 30') with the Ihh and Shh mouse cDNA clones. No additional 
Ihh clones were identified. However several 2.6kb, apparently full length, Shh clones were 
isolated. The DNA sequence of the additional 5 r coding region not present in the original 
0.64 and 2.8kb Shh clones was obtained by analysis of one of the 2.6kb inserts. 

Northern Blot Analysis 
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Expression of Shh was investigated by RNA blot analysis using 20 ^ig of total RNA 
from adult brain, spleen, kidney, liver, lung, 16.5dpc brain, liver and lung; 9.5dpc to 17.5dpc 
whole embryo; 9.5dpc forebrain, midbrain and 10.5dpc brain. RNA samples were 
electrophoretically separated on a 12% agarose gel, transferred and u.v. crosslinked to 
Genescreen (DuPont) and probed with 2X10 6 cpm/ml of an a 32 P-dATP labeled mouse Shh 
probe (2.8kb insert from Xgt 10 screen). Hybridization was performed at 42°C in 50% 
formamide 5x Denhardt's, SxSSPE, 0.1%SDS, 6.5% dextran, 200ng/ml salmon sperm DNA. 
Final wash was at 55°C in O.txSSC, 0.1%SDS. The blot was exposed for 6 days in the 
presence of an intensifying screen. 

In Situ Hybridization, $-Galactosidase Staining And Histological Analysis 

Embryos from 7.25 to 14.5dpc were analyzed for either Shh or HNF-3p expression by 
whole mount in situ hybridization to digoxygenin labeled RNA probes as described in 
Wilkinson, (1992) In situ Hybridization: A Practical Approach. Oxford; Parr et al., (1993) 
Development 119:247-261. The mouse Shh probe was either a 2.8kb or 0.6kb RNA 
transcript generated by T7 (2.8kb) or T3 (0.6kb) transcription of Xbal and HindlH digests of 
Bluescript (Stratagene) subclones of the original Shh cDNA inserts. The HNF-30 probe was 
generated by HindQI linearization of a HNF-3P cDNA clone (Sasaki, H. et al., (1993) 
Development 118: 47-59) and T7 polymerase transcription of 1.6kb transcript. Embryos 
were photographed on an Olympus-SZH photomicroscope using Kodak Ektachrome EPY 
64T color slide film. 

Sections through wild type and WEXP2-GSM transgenic embryos were prepared and 
hybridized with 35S-UIP labeled RNA probes (Wilkinson, D.G. et al., (1987) Development 
99: 493-500). Sections were photographed as described in McMahon, A.P. et al., (1992) Cell 
69: 581-595. 

p Staining of WEXP2-lacZ embryos with pwas performed according to Whiting, J. et 
al., (1991) Genes & Dev. 5: 2048-2059. General histological analysis of wildtype and 
WEXP2-CShh transgenic embryos was performed on paraffin sections of Bouin's fixed 
embryos counterstained with haematoxylin and eosin. Histological procedures were as 
described by Kaufman, M.H. (1992) The Atlas of Mouse Development, London: Academic 
Press. Sections were photographed on a Leitz Aristoplan compound microscope using 
Kodak EPY 64T color slide film. 

DNA Constructs For Transgenics 

Genomic Wnt-l fragments were obtained by screening a AX3EM12 (Promega) 129/Sv 
mouse genomic library with a 375 bp Mlul-BgFLl fragment derived from the fourth exon of 
the murine Wnt-l gene. One of the clones (Wl-15.1) was used in this study. 
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As an initial step towards the generation of the pWEXP2 expression vector, Wl-15.1 
was digested to completion with restriction enzymes Aatll and Clal, and a 2774 bp Aatll- 
Clal fragment isolated This fragment was ligated into Aatll and Clal cut pGEM-7Zf vector 
(Promega), generating pWl-18. This plasmid was digested with Hindll and ligated to 
annealed oligonucleotides lad (SEQ ID No:21) and lac2 (SEQ ID No:22) generating pWl- 
18S* which has a modified polylinker downstream of the Clal restriction site. This construct 
(pWl-18S*) was digested with Clal and Bglll and ligated with both the 2.5 kb 3' Clal - Bgfll 
exon-intron region and 5.5 kb 3' Bglll -Bglll Wnt-1 enhancer, generating pWRES4. This 
construct contains a 10.5 kb genomic region which starts upstream of the Wnt-1 translation 
initiation codon (at an Aatll site approximately l.Okb from the ATG) and extends to a BgUl 
site 5.5 kb downstream of the Wnt-l poiyadenylation signal. This plasmid also contains a 250 
bp region of the neomycin phosphotransferase (neo) gene inserted in inverse orientation in 
the y transcribed but untranslated region. Finally, to generate the WEXP2 expression vector, 
a 2 kb Sfi I fragment was amplified from pWRES4 using Sf-1 (SEQ ID No:23) and Sf-2 
(SEQ ID No:24) oligonucleotides. This amplified fragment was digested with Sfi I and 
inserted into Sfi I linearised pWRES4, generating pWEXP2. This destroys the Wnt-l 
translation initiation codon, and replaces it by a polylinker containing Nru I, Eco RV, Sac II, 
and Bst BI restriction sites, which are unique in pWEXP2. 

The WEXP2 - lacZ construct was obtained by inserting an end-filled Bgl II - Xho I 
lacZ fragment isolated from the pSDKlacZpA vector in the Nru I cut pWEXP2 expression 
vector. Similarly, the WEXP2 - CShh construct was obtained by inserting an end-filled Xbal 
cDNA fragment containing the full Chick Shh coding sequence (SEQ ID No:l) into the Nru I 
cut WEXP2 expression vector. 

Oligonucleotide sequences are as follows: 

lacl: 5 , -AGCTGTCGACGCGGCCGCTACGTAGGTTACCGACGTCAAGCTTAGATCTC-3 r 
lac2: S'-AGCTGAGATCTAAGCTTGACGTCGGTAACCTACGTAGCGGCCGCGTCGAC-S' 
Sf-1 : 5-GATCGGCCAGGCAGGCCTCGCGATATCGTCACCGCGGTATTCGAA-3 f 
Sf-2: 5'-AGTGCCAGTCGGGGCCCCCAGGGCCGCGCC-3 r 

Production And Genotyping Of Transgenic Embryos 

Transgenic mouse embryos were generated by microinjection of linear DNA 
fragments into the male pronucleus of B6CBAF1/J (C57BL/6J X CBA/J) zygotes. CD-I or 
B6CBAF1/J females were used as recipients for injected embryos. G 0 mice embryos were 
collected at 9.5, 10.5, and 11.5 dpc, photographed using an Olympus SZH stereophoto- 
microscope on Kodak EPY-64T color slide film, then processed as described earlier. 

WEXP2-/acZ and WEXP2-CShh transgenic embryos were identified by PCR analysis 
of proteinase-K digests of yolk sacs. Briefly, yolk sacs were carefully dissected free from 
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maternal and embryonic tissues, avoiding cross-contamination between littermates, then 
washed once in PBS. After overnight incubation at 55°C in 50 ul of PCR proteinase-K 
digestion buffer (McMahon, A.P. et al., (1990) Cell 62: 1073-1085). 1 ul of heat-inactivated 
digest was subjected to polymerase chain reaction (PCR) in a 20 pi volume for 40 cycles as 
follows: 94°C for 30 seconds, 55°C for 30 seconds, 72°C for 1 minute, with the reaction 
ingredients described previously (McMahon, A.P. et al., (1990) Cell 62: 1073-1085)). In the 
case of the WEXP2 - lacZ transgenic embryos, oligonucleotides 137 (SEQ ID No:25) and 
138 (SEQ ID No:26) amplify a 352 bp lacZ specific product. In the case of the WEXP2- 
CShh embryos, oligonucleotides WPR2 (^/-/-specific) (SEQ ID No:27) and 924 (Chick 
SM-specific) (SEQ ID No:28) amplify a 345 bp fragment spanning the insertion junction of 
the CYackrShh cDNA in the WEXP2 expression vector. Table 2 summarizes the results of 
WEXP2-C-SM transgenic studies. 

Oligonucleotide sequences are as follows: 
137: 5'-TACCACAGCGGATGGTTCGG-3' 
138: 5' -GTGGTGGTTATGCCGATCGC-3' 
WPR2: 5'-TAAGAGGCCTATAAGAGGCGG-3' 
924: 5'-AAGTCAGCCCAGAGGAGACT-3' 

(ii) Mouse hh Genes 

The combined screening of mouse genomic and 8.5 day post coitum (dpc) cDNA 
libraries identified three mammalian hh counterparts (Figure 5A) which herein will be 
referred to as Desert, Indian and Sonic hedgehog {Dhh, Ihh and Shh, respectively). 
Sequences encoding Dhh (SEQ ID No:2) were determined from analysis of clones identified 
by low stringency screening of a mouse genomic library. DNA sequencing of one of five 
overlapping lambda phage clones identified three homologous regions encoding a single 
open reading frame interrupted by introns in identical position to those of the Drosophila hh 
gene (Figure 5A). Splicing across the exon 1/2 boundary was confirmed by polymerase 
chain reaction (PCR) amplification of first strand cDNA generated from adult testicular 
RNA. The partial sequence of Mi (SEQ ID No:3) and the complete sequence of Shh (SEQ 
ID No:4) coding regions were determined from the analysis of overlapping cDNA clones 
isolated from 8.5 dpc cDNA libraries. The longest Shh clone, 2.6kb, appears to be full length 
when compared with the Shh transcript present in embryonic RNAs. The 1 .8kb partial length 
Ihh cDNA is complete at the 3* end, as evidenced by the presence of a polyadenylation 
consensus sequence and short poly A tail. 

Alignment of the predicted Drosophila hh protein sequence (SEQ ID No:34) with 
those of the mouse Dhh (SEQ ID No:9), Ihh (SEQ ID No:10) and Shh (SEQ ID No:l 1), and 
chick Shh (SEQ ID No:8) and zebrafish Shh (SEQ ID No: 12), reveals several interesting 
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features of the Wi-family (Figure 5A). All the vertebrate M-proteins contain an amino 
terminal hydrophobic region of approximately 20 amino acids immediately downstream of 
the initiation methioniiie. Although the properties of these new hh proteins have not been 
investigated, it is likely that this region constitutes a signal peptide and vertebrate hhs are 
secreted proteins. Signal peptide cleavage is predicted to occur (von Heijne, G., (1986) 
Nucleic Acids Research 14: 4683-4690) just before an absolutely conserved six amino acid 
stretch, CGPGRG (SEQ ID No:29) (corresponding to residues 85-90)(Figure 5A), in all hh 
proteins. This generates processed mouse Dhh (SEQ ID No:9) and Shh (SEQ ID No: 11) 
proteins of 41 and 44 kd, respectively. Interestingly, Drosophila hh (SEQ ID No:34) is 
predicted to contain a substantial amino terminal extension beyond the hydrophobic domain 
suggesting that the Drosophila protein enters the secretory pathway by a type II secretory 
mechanism. This would generate a transmembrane tethered protein which would require 
subsequent cleavage to release a 43 kd secreted form of the protein. In vitro analysis of 
Drosophila hh is consistent with this interpretation (Lee, J J. et al., (1992) Cell 71: 33-50). 
However, there also appears to be transitional initiation at a second methionine (position 51 
of SEQ ID No:34) just upstream of the hydrophobic region (Lee, JJ. et aL, (1992) Cell 71: 
33-50), suggesting that Drosophila hh 9 like its vertebrate counterparts, may also be secreted 
by recognition of a conventional amino terminal signal peptide sequence. 

Data base searches for protein sequences related to vertebrate hh's failed to identify 
any significant homologies, excepting Drosophila hh. In addition, searching the "PROSITE" 
data bank of protein motifs did not reveal any peptide motifs which are conserved in the 
different hh proteins. Thus, the hhs represent a novel family of putative cell signaling 
molecules. 

One feature of the amino acid alignment is the high conservation of hh sequences. 
Vertebrate hhs share 47 to 51% amino acid identity with Drosophila hh throughout the 
predicted processed polypeptide sequence (Figure 6). Dhh has a slightly higher identity than 
that of Ihh and Shh suggesting that Dhh may be the orthologue of Drosophila hh. 
Conservation is highest in the amino terminal half of the proteins, indeed, from position 85 
(immediately after the predicted shared cleavage site) to 249, 62% of the amino acids are 
completely invariant amongst the Drosophila and vertebrate proteins. Comparison of mouse 
Dhh, Ihh and Shh where their sequences overlap in this more conserved region, indicates that 
Ihh and Shh are more closely related (90% amino acid identity; residues 85 to 266) than with 
the Dhh sequence (80% amino acid identity; residues 85 to 266). Thus, Ihh and Shh 
presumably resulted from a more recent gene duplication event. 

Comparison of cross species identity amongst Shh proteins reveals an even more 
striking sequence conservation. Throughout the entire predicted processed sequence mouse 
and chick Shh share 84% of amino acid residues (Figure 6). However, in the amino terminal 
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half (positions 85 to 266) mouse and chick are 99% and mouse and zebrafish 94% identical 
in an 180 amino acid stretch. Conservation falls off rapidly after position 266 (Figure 5 A). 
SEQ ID No:40 shows the consensus sequence in the amino terminal half of all vertebrate Shh 
genes (human, mouse, chicken and zebrafish) identified to date. SEQ ID No:41 shows the 
consensus sequence in the amino terminal half of vertebrate hedgehog genes (Shh, Ihh, and 
Dhh) identified to date in different species (mouse, chicken, human and zebrafish). 

In summary, hh family members are likely secreted proteins consisting of a highly 
conserved amino terminal and more divergent carboxyl terminal halves. The extreme 
interspecies conservation of the vertebrate Shh protein points to likely conservation of Shh 
function across vertebrate species. 

(Hi) Expression of Mouse Shh at the Axial Midline 

Expression of Shh in the mouse was examined in order to explore the role of mouse 
Shh (SEQ ID No:l 1) in vertebrate development. Northern blots of embryonic and adult RNA 
samples were probed with a radiolabeled mouse Shh cDNA probe. An Shh transcript of 
approximately 2.6kb was detected in 9.5dpc whole embryo RNA, and 9.5 and 10.5dpc brain 
RNA fractions. No expression was detected in total RNA samples from later embryonic 
stages. Of the late fetal and adult tissue RNAs examined Shh expression was only detected in 
16.5dpc and adult lung. 

To better define the precise temporal and spatial expression of Shh an extensive series 
of whole mount and serial section in situ hybridizations were performed using digoxygenin 
and 35s-radioIabelled RNA probes, respectively, to mouse embryo samples from 7.25dpc 
(mid streak egg cylinder stage of gastrulation) to 13.5dpc. No Shh expression is detected at 
mid-gastrulation stages (7.25dpc) prior to the appearance of the node, the mouse counterpart 
of the amphibian organizer and chick Hensen's node. When the primitive streak is fully 
extended and the midline mesoderm of the head process is emerging from the node (7.5 to 
7.75dpc), Shh is expressed exclusively in the head process. At late head fold stages, Shh is 
expressed in the node and midline mesoderm of the head process extending anteriorly under 
the presumptive brain. Just prior to somite formation, Shh extends to the anterior limit of the 
midline mesoderm, underlying the presumptive midbrain. As somites are formed, the 
embryonic axis extends caudally. The notochord, which represents the caudal extension of 
the head process, also expresses Shh 9 and expression is maintained in the node. 

Interestingly, by 8 somites (8.5dpc) strong Shh expression appears in the CNS. 
Expression is initiated at the ventral midline of the midbrain, above the rostral limit of the 
head process. By 10 somites CNS expression in the midline extends rostrally in the forebrain 
and caudally into the hindbrain and rostral spinal cord. Expression is restricted in the 
hindbrain to the presumptive floorplate, whereas midbrain expression extends ventro- 
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laterally. In the forebrain, there is no morphological floor plate, however ventral Shh 
expression here is continuous with the midbrain. By 15 somites ventral CNS expression is 
continuous from the rostral limit of the diencephalon to the presumptive spinal cord in 
somitic regions. Over the next 18 to 24 hrs, to the 25-29 somite stage, CNS expression 
intensifies and forebrain expression extends rostral to the optic stalks. In contrast to all other 
CNS regions, in the rostral half of the diencephalon, Shh is not expressed at the ventral 
midline but in two strips immediately lateral to this area which merge again in the floor of the 
forebrain at its rostral limit. Expression of Shh in both the notochord and floorplate is 
retained until at least 13.5dpc. 

Several groups have recently reported the cloning and expression of vertebrate 
members of a family of transcription factors, related to the Drosophila forkhead gene. One 
of these, HNF-3$ shows several similarities in expression to Shh (Sasaki, H. et al., (1993) 
Development 118: 47-59) suggesting that HNF-3$ may be a potential regulator of Shh. To 
investigate this possibility, direct comparison of HNF-3$ and Shh expression was 
undertaken. HNF-3$ transcripts are first detected in the node (as previously reported by 
Sasaki, H. et al., (1993) supra), prior to the emergence of the head process and before Shh is 
expressed. From the node, expression proceeds anteriorly in the head process, similar to Shh 
expression. Activation of HNF-3$ within the CNS is first observed at 2-3 somites, in the 
presumptive mid and hindbrain, prior to the onset of Shh expression. By 5 somites, 
expression in the midbrain broadens ventro-laterally, extends anteriorly into the forebrain and 
caudally in the presumptive floor plate down much of the neuraxis in the somitic region. 
Strong expression is maintained at this time in the node and notochord. However, by 10 
somites expression in the head process is lost and by 25-29 somites notochordal expression is 
only present in the most extreme caudal notochord. In contrast to the transient expression of 
HNF~3$ in the midline mesoderm, expression in the floor plate is stably retained until at least 
ll.Sdpc. Thus, there are several spatial similarities between the expression of HNF-3§ and 
Shh in both the midline mesoderm and ventral CNS and it is likely that both genes are 
expressed in the same cells. However, in both regions, HNF-3$ expression precedes that of 
Shh. The main differences are in the transient expression of HNF-3$ in the head process and 
notochord and Shh expression in the forebrain. Whereas HNF-3$ and Shh share a similar 
broad ventral and ventral lateral midbrain and caudal diencephalic expression, only Shh 
extends more rostrally into the forebrain. In general, these results are consistent with a model 
in which initial activation of Shh expression may be regulated by HNF-3$. 

The similarity in Shh and HNF-3$ expression domains is also apparent in the 
definitive endoderm which also lies at the midline. Broad HNF-3$ expression in the foregut 
pocket is apparent at 5 somites as previously reported by Sasaki, H. et al., (1993) supra. Shh 
is also expressed in the endoderm, immediately beneath the forebrain. Both genes are active 
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in the rostral and caudal endoderm from 8 to 1 1 somites. Whereas HNF-3$ is uniformly 
expressed, Shh expression is initially restricted to two ventrolateral strips of cells. Ventral 
restricted expression of Shh is retained in the most caudal region of the presumptive gut until 
at least 9.5dpc whereas HNF-3& is uniformly expressed along the dorso-ventral axis. Both 
genes are expressed in the pharyngeal ectoderm at 9.5dpc and expression is maintained in the 
gut until at least ll.Sdpc. Moreover, expression of Shh in the embryonic and adult lung RNA 
suggests that endodermal expression of Shh may continue in, at least some endoderm derived 
organs. 

(iv) Expression Of Shh In The Limb 

Expression of Shh is not confined to midline structures. By 30-35 somites (9.75dpc), 
expression is detected in a small group of posterior cells in the forelimb bud. The forelimb 
buds form as mesenchymal outpocketings on the flanks, opposite somites 8 to 12, at 
approximately the 17 to 20 somite stage. Shh expression is not detectable in the forelimbs 
until about 30-35 somites, over 12 hours after the initial appearance of the limbs. Expression 
is exclusively posterior and restricted to mesenchymal cells. By 10.5dpc, both the fore and 
hindlimbs have elongated substantially from the body flank. At this time Shh is strongly 
expressed in the posterior, distal aspect of both limbs in close association with the overlying 
ectoderm. Analysis of sections at this stage detects Shh expression in an approximately six 
cell wide strip of posterior mesenchymal cells. In the forelimb, Shh expression ceases by 
ll.Sdpc. However, posterior, distal expression is still detected in the hindlimb. No limb 
expression is detected beyond 12.5dpc. 

(v) Ectopic Expression Of Shh 

Grafting studies carried out principally in the chick demonstrate that cell signals 
derived from the notochord and floor plate pattern the ventral aspect of the CNS (as described 
above). In the limb, a transient signal produced by a group of posterior cells in both limb 
buds, the zone of polarizing activity (ZPA), is thought to regulate patterning across the 
anterior-posterior axis. Thus, the sequence of Shh 9 which predicts a secreted protein and the 
expression profile in midline mesoderm, the floor plate and in the limb, suggest that Shh 
signaling may mediate pattern regulation in the ventral CNS and limb. 

To determine whether Shh may regulate ventral development in the early mammalian 
CNS, a Wnt~l enhancer was used to alter its normal domain of expression. Wnt~l shows a 
dynamic pattern of expression which is initiated in the presumptive midbrain just prior to 
somite formation. As the neural folds elevate and fuse to enclose the neural tube, Wnt-1 
expression in the midbrain becomes restricted to a tight circle, just anterior of the midbrain, 
the ventral midbrain and the dorsal midline of the diencephalon, midbrain, myelencephalon 
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and spinal cord (Wilkinson, D.G. et al., (1987) Cell 50: 79-88; McMahon, A.P. et al., (1992) 
Cell 69: 581-595; Parr, BA et al., (1993) Development 1 19: 247-261). 

It was determined that essentially normal expression of lacZ reporter constructs 
within the Wnt-l expression domain is dependent upon a 5.5kb enhancer region which lies 
downstream of the Wnt-l polyadenylation sequence. A construct was generated for ectopic 
expression of cDNA clones in the Wnt-l domain and tested in transgenics using a lacZ 
reporter (pWEXP-lacZ; Figure 9). Two of the four G c transgenic embryos showed readily 
detectable p-galactosidase activity, and in both expression occurred throughout the normal 
Wnt-l expression domain. More extensive studies with a similar construct also containing the 
5.5kb enhancer gave similar frequencies. Some ectopic expression was seen in newly 
emerging neural crest cells, probably as a result of perdurance of p-galactosidase RNA or 
protein in the dorsally derived crest. Thus, the Wnt-l expression construct allows the efficient 
ectopic expression of cDNA sequences in the midbrain and in the dorsal aspect of much of 
the CNS. 

An Shh ectopic expression construct (pWEXP-GSftfc) containing two tandem head to 
tail copies of a chick Shh cDNA was generated (Figure 7). By utilizing this approach, ectopic 
expression of the chick Shh is distinguishable from that of the endogenous mouse Shh gene. 
Chick Shh shows a high degree of sequence identity and similar expression to the mouse 
gene. Thus, it is highly likely that Shh function is widely conserved amongst vertebrates, a 
conclusion further supported by studies of the same gene in zebrafish. 

Table 2 shows the results of several transgenic experiments in which the G Q 
population was collected at 9.5 to 11.5dpc. Approximately half of the transgenic embryos 
identified at each stage of development had a clear, consistent CNS phenotype. As we 
expect, on the basis of control studies using the 5.5kb Wnt-l enhancer, that only half the 
transgenics will express the transgene, it is clear that in most embryos ectopically expressing 
chick Shh, an abnormal phenotype results. 

TABLE 2 

Summary of WEXP2-Chick Shh transgenic studies 



Age (dpc) 


Number of 
Embryos 


Number of 
Transgenics 


Number of Embryos with 
CNS phenotype 3 


9.5 


37 


11 


6 (54.5%) 


10.5 


59 


16 


8 (50%) 


11.5 


33 


7 


3 (42.9%) 



a In addition one 9.5pc and two 10.5pc transgenic embryos showed non-specific growth retardation, 
as occurs at low frequency in transgenic studies. These embryos were excluded from further analysis. 
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At 9.5dpc, embryos with a weaker phenotype show an open neural plate from the mid 
diencephalon to the myelencephalon. In embryos with a stronger phenotype at the same 
stage, the entire diencephalon is open and telencephalic and optic development is 
morphologically abnormal. As the most anterior diencephalic expression of Wnt-l is lower 
than that in more caudal regions, the differences in severity may relate to differences in the 
level of chick Shh expression in different G Q embryos. At the lateral margins of the open 
neural folds, where Wnt-l is normally expressed, there is a thickening of the neural tissue 
extending from the diencephalon to myelencephalon. The cranial phenotype is similar at 
10.5 and 11.5 dpc. However, there appears to be a retardation in cranial expansion of the 
CNS at later stages. 

In addition to the dorsal cranial phenotype, there is a progressive dorsal phenotype in 
the spinal cord. At 9.5 dpc, the spinal cord appears morphologically normal, except at 
extreme rostral levels. However by 10.5dpc, there is a dorsal dysmorphology extending to the 
fore or hindlimbs. By ll.Sdpc, all transgenic embryos showed a dorsal phenotype along 
almost the entire spinal cord. Superficially, the spinal cord had a rippled, undulating 
appearance suggestive of a change in cell properties dorsally. This dorsal phenotype, and the 
cranial phenotype were examined by histological analysis of transgenic embryos. 

Sections through a 9.5dpc embryo with an extreme CNS phenotype show a 
widespread dorsal perturbation in cranial CNS development. The neural/ectodermal junction 
in the diencephalon is abnormal. Neural tissue, which has a columnar epithelial morphology 
quite distinct from the squamous epithelium of the surface ectoderm, appears to spread 
dorsolaterally. The myelencephalon, like the diencephalon and midbrain, is open rostrally. 
Interestingly, there are discontinuous dorso-lateral regions in the myelencephalon with a 
morphology distinct from the normal roof plate regions close to the normal site of Wnt-l 
expression. These cells form a tight, polarized epithelium with basely located nuclei, a 
morphology similar to the floor plate and distinct from other CNS regions. Differentiation of 
dorsally derived neural crest occurs in transgenic embryos as can be seen from the presence 
of cranial ganglia. In the rostral spinal cord, the neural tube appeared distended dorso- 
laterally which may account for the superficial dysmorphology. 

By 1 l.Sdpc, CNS development is highly abnormal along the entire dorsal spinal cord 
to the hindlimb level. The dorsal half of the spinal cord is enlarged and distended. Dorsal 
sensory innervation occurs, however, the neuronal trajectories are highly disorganized. Most 
obviously, the morphology of dorsal cells in the spinal cord, which normally are elongated 
cells with distinct lightly staining nuclei and cytoplasm, is dramatically altered. Most of the 
dorsal half of the spinal cord consists of small tightly packed cells with darkly staining nuclei 
and little cytoplasm. Moreover, there appears to be many more of these densely packed cells, 
leading to abnormal outgrowth of the dorsal CNS. In contrast, ventral development is 
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normal, as are dorsal root ganglia, whose origins lie in neural cells derived from the dorsal 
spinal cord. 

(vi) Ectopic Shh Expression Activates Floor Plate Gene Expression 

To determine whether ectopic expression of chick Shh results in inappropriate 
activation of a ventral midline development in the dorsal CNS, expression of two floor plate 
expressed genes, HNF-3p and mouse Shh, were examined. Whole mounts of 9.5dpc 
transgenic embryos show ectopic expression of HNF-3p throughout the cranial WntA 
expression domain. In addition to normal expression at the ventral midline, HNF-3P 
transcripts are expressed at high levels, in a circle just rostral to the mid/hindbrain junction, 
along the dorsal (actually lateral in unfused brain folds) aspects of the midbrain and, more 
weakly, in the roof plate of the myelencephalon. No expression is observed in the 
metencephalon which does not express WntA, Thus, ectopic expression of Shh leads to the 
activation of HNF-3P throughout the cranial Wnt-l expression domain. 

The relationship between chick Shh expression and the expression of HNF-3p in 
serial sections was also examined. Activation of HNF-3p in the brain at 9.5 and 10.5dpc is 
localized to the dorsal aspect in good agreement with the observed ectopic expression of 
chick Shh. Interestingly mouse Shh is also activated dorsally. Thus, two early floor plate 
markers are induced in response to chick Shh. 

From 9.5dpc to lL5dpc, the spinal cord phenotype becomes more severe. The 
possibility that activation of a floor plate pathway may play a role in the observed phenotype 
was investigated. In contrast to the brain, where ectopic HNF-3p and Shh transcripts are still 
present, little or no induction of these floor plate markers is observed. Thus, although the 
dorsal spinal cord shows a widespread transformation in cellular phenotype, this does not 
appear to result from the induction of floor plate development 

Example 3 

Chick Sonic Hedgehog Mediates ZPA Activity 

(i) Experimental Procedures 

Retinoic Acid Bead Implants 

Fertilized white Leghorn chicken eggs were incubated to stage 20 and then implanted 
with AG1-X2 ion exchange beads (Biorad) soaked in 1 mg/ml retinoic acid (RA, Sigma) as 
described by Tickle, C. et aL, (1985) Dev. Biol 109: 82-95. Briefly, the beads were soaked 
for 15 min in 1 mg/ml RA in DMSO, washed twice and implanted under the AER on the 
anterior margin of the limb bud. After 24 or 36 hours, some of the implanted embryos were 
harvested and fixed overnight in 4% paraformaldehyde in PBS and then processed for whole 
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mount in situ analysis as previously described. The remainder of the animals were allowed to 
develop to embryonic day 10 to confirm that the dose of RA used was capable of inducing 
mirror image duplications. Control animals were implanted with DMSO soaked beads and 
showed no abnormal phenotype or gene expression. 

5 

Plasmids 

Unless otherwise noted, all standard cloning techniques were performed according to 
Ausubel, F.M. et ah, (1989) Current Protocols in Molecular Biology (N.Y.: Greene 
Publishing Assoc. and Wiley Inerscience), and all enzymes were obtained from Boehringer 
10 Mannheim Biochemicals. pHH-2 is a cDNA contain the entire coding region of chicken 
Sonic hedgehog (SEQ ID No:l). RCASBP(A) and RCASBP(E) are replication-competent 
retroviral vectors which encode viruses with differing host ranges. RCANBP(A) is a variant 
of RCASBP(A) from which the second splice acceptor has been removed. This results in a 
gj virus which can not express the inserted gene and acts as a control for the effects of viral 

PJ 15 infection (Hughes, S.H. et al., (1987) J. Virol 61: 3004-3012; Fekete, D. et al., (1993) Mol 
J Cell Biol 13: 2604-2613). RCASBP/AP(E) is version of RCASBP(E) containing a human 

W placental alkaline phosphatase cDNA (Fekete, D. et al., (1993b) Proa Natl Acad. Sci. USA 

jg 90: 2350-2354). SLAX13 is a pBluescript SK+ derived plasmid with a second Cla I 

9 restriction site and the 5' untranslated region of v-src (from the adaptor plasmid CLA12-Nco, 

5 20 Hughes, S.H. et al., (1987) J. Virol 61: 3004-3012) cloned 5' of the EcoRI (and Clal) site in 
P the pBluescript polylinker. RCASBP plasmids encoding Sonic from either the first (Ml) or 

R* 1 second (M2) methionine (at position 4) were constructed by first shuttling the L7kb Sonic 

p fragment of pHH-2 into SLAX-13 using oligonucleotides to modify the 5* end of the cDNA 

such that either the first or second methionine is in frame with the Ncol site of SLAX-13. 
25 The amino acid sequence of Sonic is not mutated in these constructs. The Ml and M2 Sonic 
Clal fragments (v-src S'lTTRSonic) were each then subcloned into RCASBP(A), 
RCANBP(A) and RCASBP(E), generating Sonic/RCAS-Al, Sonic/RCAS-A2 5 
Sonic/RCAN-Al, Sonic/RCAN-A2, Sonic/RCAS-El and Sonic/RCAS-E2. 

Chick Embryos, Cell Lines And Virus Production 

30 All experimental manipulations were performed on standard specific-pathogen free 

White Leghorn chick embryos (S-SPF) from closed flocks provided fertilized by SPAFAS 
(Norwich, Conn). Eggs were incubated at 37.5°C and staged according to Hamburger, V. et 
al., (1951) 7. Exp. Morph 88: 49-92. All chick embryo fibroblasts (CEF) were provided by 
C. Cepko. S-SPF embryos and CEFs have previously been shown to be susceptible to 

35 RCASBP(A) infection but resistant to RCASBP(E) infection (Fekete, D. et al., (1993b) Proc. 
Natl Acad. Set USA 90: 2350-2354). Line 15b CEFs are susceptible to infection by both 
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RCASBP(A) and (E). These viral host ranges were confirmed in control experiments. CEF 
cultures were grown and transfected with retroviral vector DNA as described (Morgan, B.A. 
et aL, (1993) Nature 358: 236-239; Fekete, D. et al., (1993b) Proc. Natl Acad Set USA 90: 
2350-2354). All viruses were harvested and concentrated as previously described (Morgan, 
5 B.A. et al., (1993) Nature 358: 236-239; Fekete, D. et al., (1993b) Proc. Natl Acad Set USA 
90: 2350-2354) and had titers of approximately 10** cfu/ml. 

Cell Implants 

A single 60mm dish containing line 15b CEFs which had been infected with either 
RCASBP/AP(E), Sonic/RCAS-El or Sonic/RCAS-E2 were grown to 50-90% confluence, 
10 lightly trypsinized and then spun at 1000 rpm for 5 min in a clinical centrifuge. The pellet 
was resuspended in 1 ml media, transferred to a microcentrifuge tube and then 
microcentrifuged for 2 min at 2000 rpm. Following a 30 min incubation at 37° C, the pellet 
was respun for 2 min at 2000 rpm and then lightly stained in media containing 0.01% nile 
p blue sulfate. Pellet fragments of approximately 300^im x lOO^m x 50jxm were implanted as 

|f 15 a wedge to the anterior region of hh stage 19-23 wing buds (as described by Riley, B.B. et aL, 
% 4 (1993) Development 118: 95-104). At embryonic day 10, the embryos were harvested, fixed 

*p in 4% paraformaldehyde in PBS, stained with alcian green, and cleared in methyl salicylate 

O (Tickle, C. et al., (1985) Dev. Biol 109: 82-95). 

s9 



O 



Viral Infections 



H 20 Concentrated Sonic/RCAS-A2 or Sonic/RCAN-A2 was injected under the AER on 

P the anterior margin of stage 20-22 wing buds. At 24 or 36 hours post-infection, the embryos 

^ were harvested, fixed in 4% paraformaldehyde in PBS and processed for whole mount in situ 

analysis as previously described. 

(ii) Co-Localization Of Sonic Expression And Zpa Activity 

25 ZPA activity has been carefully mapped both spatially and temporally within the limb 

bud (Honig, L.S. et al., (1985) J. Embryol exp. Morph 87: 163-174). In these experiments 
small blocks of limb bud tissue from various locations and stages of chick embiyogenesis 
(Hamburger, V et al., (1951) J. Exp. Morph 88: 49-92) were grafted to the anterior of host 
limb buds and the strength of ZPA activity was quantified according to degrees of digit 

30 duplication. Activity is first weakly detected along the flank prior to limb bud outgrowth. 
The activity first reaches a maximal strength at stage 19 in the proximal posterior margin of 
the limb bud. By stage 23 the activity extends the full length of the posterior border of the 
limb bud. The activity then shifts distally along the posterior margin so that by stage 25 it is 
no longer detectable at the base of the flank. The activity then fades distally until it is last 

35 detected at stage 29. 
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This detailed map of endogenous polarizing activity provided the opportunity to 
determine the extent of the correlation between the spatial pattern of ZPA activity and Sonic 
expression over a range of developmental stages. Whole mount in situ hybridization was 
used to assay the spatial and temporal pattern of Sonic expression in the limb bud. Sonic 
expression is not detected until stage 17, at the initiation of limb bud formation, at which 
time it is weakly observed in a punctate pattern reflecting a patchy expression in a few cells. 
From that point onwards the Sonic expression pattern exactly matches the location of the 
ZPA, as determined by Honig, L.S. et al., (1985) J. Embryol exp. Morph 87: 163-174, both 
in position and in intensity of expression. 

(Hi) Induction Of Sonic Expression By Retinoic Acid 

A source of retinoic acid placed at the anterior margin of the limb bud will induce 
ectopic tissue capable causing mirror-image duplications (Summerbell, D. et al., (1983) In 
Limb Development and Regeneration (N.Y.: Ala R. Liss) pp. 109-118; Wanek, N. et al., 
(1991) Nature 350: 81-83). The induction of this activity is not an immediate response to 
retinoic acid but rather takes approximately 18 hours to develop (Wanek, N. et al., (1991) 
Nature 350: 81-83). When it does develop, the polarizing activity is not found surrounding 
the implanted retinoic acid source, but rather is found distal to it in the mesenchyme along 
the margin of the limb bud (Wanek, N. et al., (1991) Nature 350: 81-83). 

If Sonic expression is truly indicative of ZPA tissue, then it should be induced in the 
ZPA tissue which is ectopically induced by retinoic acid. To test this, retinoic acid-soaked 
beads were implanted in the anterior of limb buds and the expression of Sonic after various 
lengths of time using whole-mount in situ hybridization was assayed. As the limb bud 
grows, the bead remains imbedded proximally in tissue which begins to differentiate. 
Ectopic Sonic expression is first detected in the mesenchyme 24 hours after bead 
implantation. This expression is found a short distance from the distal edge of the bead. By 
36 hours Sonic is strongly expressed distal to the bead in a stripe just under the anterior 
ectoderm in a mirror-image pattern relative to the endogenous Sonic expression in the 
posterior of the limb bud. 

(iv) Effects Of Ectopic Expression Of Sonic On Limb Patterning 

The normal expression pattern of Sonic, as well as that induced by retinoic acid, is 
consistent with Sonic being a signal produced by the ZPA. To determine whether Sonic 
expression is sufficient for ZPA activity, the gene was ectopically expressed within the limb 
bud. In most of the experiments we have utilized a variant of a replication-competent 
retroviral vector called RCAS (Hughes, S.H. et al., (1987) 1 Virol 61: 3004-3012)) both as a 
vehicle to introduce the Sonic sequences into chick cells and to drive their expression. The 
fact that there exists subtypes of avian retroviruses which have host ranges restricted to 
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particular strains of chickens was taken advantage of to control the region infected with the 
Sonic/RCAS virus (Weiss, R. (et al.) (1984) RNA Tumor Viruses, Vol. 1 Weiss et al. eds., 
(N.Y.: Cold Spring Harbor Laboratories) pp. 209-260); Fekete, D. et al., (1993a) Mol Cell 
Biol 13: 2604-2613). Thus a vector with a type E envelope protein (RCAS-E, Fekete, D. et 
5 al., (1993b) Proa Natl Acad Scl USA 90: 2350-2354) is unable to infect the cells of the 
SPAFAS outbred chick embryos routinely used in our lab. However, RCAS-E is able to 
infect cells from chick embryos of line 15b. In the majority of experiments, primary chick 
embryo fibroblasts (CEFs) prepared from line 15b embryos in vitro were infected. The 
infected cells were pelleted and implanted into a slit made in the anterior of S-SPF host limb 
10 buds. Due to the restricted host range of the vector, the infection was thus restricted to the 
graft and did not spread through the host limb bud. 

To determine the fate of cells implanted and to control for any effect of the implant 
procedure, a control RCAS-E vector expressing human placental alkaline phosphatase was 
used. Alkaline phosphatase expression can be easily monitored histochemically and the 
(3 15 location of infected cells can thus be conveniently followed at any stage. Within 24 hours 
Fl following implantation the cells are dispersed proximally and distally within the anterior 

^ margin of the limb bud. Subsequently, cells are seen to disperse throughout the anterior 

111 portion of the limb and into the flank of the embryo. 

Q v 

Limb buds grafted with alkaline phosphatase expressing cells or uninfected cells give 
q 20 rise to limbs with structures indistinguishable from unoperated wild type limbs. Such limbs 
N have the characteristic anterior-to-posterior digit pattern 2-3-4. ZPA grafts give rise to a 

^ variety of patterns of digits depending on the placement of the graft within the bud (Tickle, 

2 C. et al., (1975) Nature 254: 199-202) and the amount of tissue engrafted (Tickle, C. (1981) 

9* Nature 289: 295-298). In some instances the result can be as weak as the duplication of a 

25 single digit 2. However, in optimal cases the ZPA graft evokes the production of a full 
mirror image duplication of digits 4-3-2-2-3-4 or 4-3-2-3-4 (see Figure 8). A scoring system 
has been devised which rates the effectiveness of polarizing activity on the basis of the most 
posterior digit duplicated: any graft which leads to the development of a duplication of digit 4 
has been defined as reflecting 100% polarizing activity (Honig, L.S. et al., (1985) 7. Embryol 
30 Exp. Morph 87: 163-174). 

Grafts of 1 5b fibroblasts expressing Sonic resulted in a range of ZPA-like phenotypes. 
In some instances the resultant limbs deviate from the wild type solely by the presence of a 
mirror-image duplication of digit 2. The most common digit phenotype resulting from 
grafting Sow/c-infected CEF cells is a mirror-image duplication of digits 4 and 3 with digit 2 
35 missing: 4-3-3-4. In many such cases the two central digits appear fused in a 4-3/3-4 pattern. 
In a number of the cases the grafts induced full mirror-image duplications of the digits 
equivalent to optimal ZPA grafts 4-3-2-2-3-4. Besides the digit duplications, the ectopic 
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expression of Sonic also gave rise to occasional duplications of proximal elements including 
the radius or ulna, the humerus and the coracoid. While these proximal phenotypes are not 
features of ZPA grafts, they are consistent with an anterior-to-posterior respecifi cation of cell 
fate. In some instances, most commonly when the radius or ulna was duplicated, more 
5 complex digit patterns were observed. Typically, an additional digit 3 was formed distal to a 
duplicated radius. 

The mirror-image duplications caused by ZPA grafts are not limited to skeletal 
elements. For example, feather buds are normally present only along the posterior edge of 
the limb. Limbs exhibiting mirror-image duplications as a result of ectopic Sonic expression 
10 have feather buds on both their anterior and posterior edges, similar to those observed in ZPA 
grafts. 

While ZPA grafts have a powerful ability to alter limb pattern when placed at the 
anterior margin of a limb bud, they have no effect when placed at the posterior margin 
P (Saunders, J.W. et al., (1968) Epithelial-Mesenchymal Interaction, Fleischmayer and 

|2 15 Billingham, eds. (Baltimore: Williams and Wilkins) pp. 78-97). Presumably, the lack of 
M posterior effect is a result of polarizing activity already being present in that region of the 

J bud. Consistent with this, grafts of Sonic expressing cells placed in the posterior of limb 

p buds never result in changes in the number of digits. Some such grafts did produce 

^ distortions in the shape of limb elements, the most common being a slight posterior curvature 

p 20 in the distal tips of digits 3 and 4 when compared to wild type wings. 

Q (v) Effect Of Ectopic Sonic Expression On Hoxd Gene Activity 

iS The correct expression of Hoxd genes is part of the process by which specific skeletal 

^ s elements are determined (Morgan, B.A. et al., (1993) Nature 358: 236-239). A transplant of 

a ZPA into the anterior of a chick limb bud ectopically activates sequential transcription of 
25 Hoxd genes in a pattern which mirrors the normal sequence of Hoxd gene expression (Nohno, 
T. et al., (1991) Cell 64: 1197-1205; Izpisua-Belmonte, J.C. et al., (1991) Nature 350: 585- 
589). Since ectopic Sonic expression leads to the same pattern duplications as a ZPA graft, 
we reasoned that Sonic would also lead to sequential activation of Hoxd genes. 

To test this hypothesis, anterior buds were injected with Sonic/RCAS-A2, a virus 
30 which is capable of directly infecting the host strains of chicken embryos. This approach 
does not strictly limit the region expressing Sonic (being only moderately controlled by the 
timing, location and titer of viral injection), and thus might be expected to give a more 
variable result. However, experiments testing the kinetics of viral spread in infected limb 
buds indicate that infected cells remain localized near the anterior margin of the bud for at 
35 least 48 hours. Hoxd gene expression was monitored at various times post infection by 
whole mount in situ hybridization. As expected, these genes are activated in a mirror-image 
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pattern relative their expression in the posterior of control limbs. For example, after 36 hours 
Hoxd-13 is expressed in a mirror-image symmetrical pattern in the broadened distal region of 
infected limb buds. Similar results were obtained with other Hoxd genes (manuscript in 
preparation). 



Example 4 

A Functionally Conserved Homolog of Drosophila Hedgehog is Expressed in 
Tissues With Polarizing Activity in Zebrafish Embryos 

(i) Experimental Procedures 

Cloning and Sequencing 

Approximately 1.5 x 10^ plaques of a 33h zebrafish embryonic Xgtl 1 cDNA library 
were screened by plaque hybridization at low stringency (McGinnis, W. et al., (1984) Nature 
308: 428-433) using a mix of two hh sequences as a probe: a Drosophila hh 400bp EcoRI 
fragment and a murine Ihh 264bp BamHI-EcoRI exon 2 fragment Four clones were isolated 
and subcloned into the EcoRI sites of pUC18 T3T7 (Pharmacia). Both strands of clone 8.3 
were sequenced using nested deletions (Pharmacia) and internal oligonucleotide primers. 
DNA sequences and derived amino acid sequences were analyzed using "Geneworks" 
(Intelligenetics) and the GCG software packages. 

PCR amplification 

Degenerate oligonucleotides hh5. 1 (SEQ ID No:30) and M3.3 (SEQ ID No:3 1) were 
used to amplify genomic zebrafish DNA 

hh 5.1: AG(CA)GITG(CT)AA(AG)GA(AG)(CA)(AG)I(GCT)IAA 
hh 3.3: CTCIACIGCIA(GA)ICK=(GT)IGCIA 

PCR was performed with an initial denaturation at 94°C followed by 35 cycles of 47°C for 1 
min, 72°C for 2min and 94°C for 1 min with a final extension at 72°C. Products were 
subcloned in pUC 1 8 (Pharmacia). 

In Situ Hybridization 

In situ hybridizations of zebrafish embryos were performed as described in Oxtoby, 
E. et al., (1993) Nuc. Acids REs. 21: 1087-1095 with the following modifications: Embryos 
were rehydrated in ethanol rather than methanol series; the proteinase K digestion was 
reduced to 5 min and subsequent washes were done in PBTw without glycine; the antibody 
was preadsorbed in PBTw, 2mg/ml BSA without sheep serum; and antibody incubation was 
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performed in PBTw, 2mg/ml BSA. Drosophila embryos were processed and hybridized as 
previously described. 



Histology 

Stained embryos were dehydrated through ethanol:butanol series, as previously 
described (Godsave, S.F. et al., (1988) Development 102: 555-566), and embedded in 
Fibrowax. 8^m sections were cut on an Anglian rotary microtome 

RNA Probe Synthesis 

For analysis of Shh expression, two different templates were used with consistent 
results; (i) phhfcj 8.3 linearized with Bgl II to transcribe an antisense RNA probe that 
excludes the conserved region, and (ii) phh[c] 8.3 linearized with Hind III to transcribe an 
antisense RNA that covers the complete cDNA. All in situ hybridizations were performed 
with the latter probe which gives better signal. Other probes were as follows: Axial Dral- 
linearised p6TlN (StrShle, U. et ah, (1993) Genes & Dev. 7: 1436-1446) using T3 RNA 
polymerase, gsc linearized with EcoRl and transcribed with T7: pax 2 Bam Hl-linearized 
pcF16 (Krauss, S. et al., (1991) Development 113: 1193-1206) using T7 RNA polymerase. 
In situ hybridizations were performed using labelled RNA at a concentration of 1 ng/ml final 
concentration. Antisense RNA probes were transcribed according to the manufacturer's 
protocol (DIG RNA Labelling Kit, BCL). 

Zebrafish Strains 

Wild type fish were bred from a founder population obtained from the Goldfish Bowl, 
Oxford. The mutant cyclops strain bl6 and the mutant notail strains bl60 and bl95 were 
obtained from Eugene, Oregon. Fish were reared at 28°C on a 14h light/1 Oh dark cycle. 

RNA Injections 

The open reading frame of Shh was amplified by PCR, using oligonucleotides 
5'-CTGCAGGGATCCACCATGCGGClTTTGACGAG-3 f (SEQ ID No:32), which contains 
a consensus Kozak sequence for translation initiation, and 5'-CTGCAGGGATC- 
CTTATTCCACACGAGGGATT-3 1 (SEQ ID No:33), and subcloned into the Bglll site of 
pSP64T (Kreig, P.A. et al., (1984) NucAcids Res. 12: 7057-7070). This vector includes 5' 
and 3' untranslated Xenopus p-Globin sequences for RNA stabilization and is commonly 
used for RNA injections experiments in Xenopus. In vitro transcribed Shh RNA at a 
concentration of approximately 100 (ig/ml was injected into a single cell of naturally 
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spawned zebrafish embryos at one-cell to 4-cell stages using a pressure-pulsed Narishige 
microinjector. The injected volume was within the picolitre range. Embryos were fixed 20 
to 27 hrs after injection in BT-Fix (Westerfield, M. (1989) The Zebrafish Book, (Eugene: The 
University of Oregon Press)) and processed as described above for whole-mount in situ 
5 hybridizations with the axial probe. 

Transgenic Drosophila 

An EcoRl fragment, containing the entire Shh ORF, was purified from the plasmid 
p/zA[c]8.3 and ligated with phosphatased EcoRl digested transformation vector pCaSpeRhs 
(Thummel, C.S. et aL, (1988) Gene 74: 445-456). The recombinant plasmid, pHS Shh 
10 containing the Shh ORF in the correct orientation relative to the heat shock promoter, was 
selected following restriction enzyme analysis of miniprep DNA from transformed colonies 
and used to transform Drosophila embryos using standard microinjection procedures 
(Roberts, D.B. (1986), Drosophila, A Practical Approach, Roberts, D.B., ed., (Oxford: IRL 
£3 Press) pp. 1-38). 

£ J 15 Ectopic Expression In Drosophila Embryos 

|p Embryos carrying the appropriate transgenes were collected over 2 hr intervals, 

P transferred to thin layers of 1% agarose on glass microscope slides and incubated in a plastic 

^ Petri dish floating in a water bath at 37°C for 30 min intervals. Following heat treatment, 

$ 

p embryos were returned to 25°C prior to being fixed for in situ hybridization with DIG 

^ 20 labelled single stranded Shh, wg or ptc RNA probes as previously described (Ingham et al., 

jy (1991) Curr. Opin. Genet Dev. 1 : 261-267). 

i 

(ii) Molecular Cloning Of Zebrafish Hedgehog Homologues 

In an initial attempt to isolate sequences homologous to Drosophila hh, a zebrafish 
genomic DNA library was screened at reduced stringency with a partial cDNA, MPCR4.1, 

25 corresponding to the first and second exons of the Drosophila gene (Mohler, J. et al., (1992) 
Development 115: 957-971). This screen proved unsuccessful; however, a similar screen of a 
mouse genomic library yielded a single clone with significant homology to hh. 9 subsequently 
designated Ihh. A 264bp BamHI-EcoRI fragment from this lambda clone containing 
sequences homologous to the second exon of the Drosophila gene was subcloned and, 

30 together with the Drosophila partial cDNA fragment, used to screen a A,gtl 1 zebrafish cDNA 
library that was prepared from RNA extracted from 33h old embryos. This screen yielded 
four clones with overlapping inserts the longest of which is 1.6kb in length, herein referred to 
as Shh (SEQIDNo:5). 



35 



(Hi) A Family Of Zebrafish Genes Homologous To The Drosophila Segment Polarity Gene, 
Hedgehog 
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Alignment of the predicted amino acid sequences of Shh (SEQ ID No: 12) and hh 
(SEQ ID No:34) revealed an identity of 47%, confirming that Shh is a homolog of the 
Drosophila gene. A striking conservation occurs within exon 2: an 80 amino acid long 
domain shows 72% identity between Shh and Drosophila hh. (Figure 9A). This domain is 
5 also highly conserved in all fc/t-related genes cloned so far and is therefore likely to be 
essential to the function of hh proteins. A second domain of approximately 30 amino acids 
close to the carboxy-terminal end, though it shows only 61% amino-acid identity, possesses 
83% similarity between Shh and hh when allowing for conservative substitutions and could 
also, therefore, be of functional importance (Figure 9B). Although putative sites of post- 
10 translational modification can be noted, their position is not conserved between Shh and hh, 

Lee, J J. et ah, (1992) Cell 71: 33-50, identified a hydrophobic stretch of 21 amino 
acids flanked downstream by a putative site of signal sequence cleavage (predicted by the 
algorithm of von Heijne, G. (1986) Nuc. Acids Res. 11) close to the ammo-terminal end of 
p hh Both the hydrophobic stretch and the putative signal sequence cleavage sites of hh> 

(0 15 which suggest it to be a signaling molecule, are conserved in Shh In contrast to hh, Shh does 
not extend N-terminally to the hydrophobic stretch. 



Using degenerate oligonucleotides corresponding to amino-acids flanking the domain 



M 
VI 

p of high homology between Drosophila hh and mouse Ihh exons 2 described above, fragments 

tf* of the expected size were amplified from zebrafish genomic DNA by PCR. After subcloning 

p 20 . and sequencing, it appeared that three different sequences were amplified, all of which show 

NI high homology to one another and to Drosophila hh (Figure 10). One of these corresponds to 

P Shh therein referred to as 2-hh(a) (SEQ ID No:16) and 2hh(b) (SEQ ID No:17), while the 
other two represent additional zebrafish hh homologs (SEQ ID No:5). cDNAs corresponding 

^ to one of these additional homologs have recently been isolated, confirming that it is 

25 transcribed. Therefore, Shh represents a member of a new vertebrate gene family. 

(iv) Shh Expression In The Developing Zebrafish Embryo 

Gastrula stages 

Shh expression is first detected at around the 60% epiboly stage of embryogenesis in 
the dorsal mesoderm. Transcript is present in a triangular shaped area, corresponding to the 

30 embryonic shield, the equivalent of the amphibian organizer, and is restricted to the inner cell 
layer, the hypoblast. During gastrulation, presumptive mesodermal cells involute to form the 
hypoblast, and converge towards the future axis of the embryo, reaching the animal pole at 
approximately 70% epiboly. At this stage, Shh -expressing cells extend over the posterior 
third of the axis, and the signal intensity is not entirely homogeneous, appearing stronger at 

35 the base than at the apex of the elongating triangle of cells. 
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This early spatial distribution of Shh transcript is reminiscent of that previously 
described for axial, a forkhead-ttlated gene; however, at 80% epiboly, axial expression 
extends further towards the animal pole of the embryo and we do not see Shh expression in 
the head area at these early developmental stages. 

5 By 100% epiboly, at 9.5 hours of development, the posterior tip of the Shh expression 

domain now constitutes a continuous band of cells that extends into the head. To determine 
the precise anterior boundary of Shh expression, embryos were simultaneously hybridized 
with probes of Shh and pax-2 (previously pax[b]\ the early expression domain of which 
marks the posterior midbrain (Krauss, S. et al. (1991) Development 1 13: 1 193-1206). By this 
10 stage, the anterior boundary of the Shh expression domain is positioned in the centre of the 
animal pole and coincides approximately with that of axial At the same stage, prechordal 
plate cells expressing the homeobox gene goosecoid (gsc) overlap and underlay the 
presumptive forebrain (Statchel, S.E. et al., (1993) Development 117: 1261-1274). Whereas 
p axial is also thought to be expressed in head mesodermal tissue at this stage, we cannot be 

$Q 15 certain whether Shh is expressed in the same cells. Sections of stained embryos suggest that 
J] in the head Shh may by this stage be expressed exclusively in neuroectodermal tissue. 

(v) Somitogenesis 

o 

By the onset of somitogenesis (approximately 10.5h of development), Shh expression 
* in the head is clearly restricted to the ventral floor of the brain, extending from the tip of the 

20 diencephalon caudally through the hindbrain. At this stage, expression of axial has also 
O disappeared from the head mesoderm and is similarly restricted to the floor of the brain; in 

!Jf contrast to Shh, however, it extends only as far as the anterior boundary of the midbrain. At 

p this point, gsc expression has become very weak and is restricted to a ring of cells that appear 

to be migrating away from the dorsal midline. 

25 As somitogenesis continues, Shh expression extends in a rostral-caudal progression 

throughout the ventral region of the central nervous system (CNS). Along the spinal cord, 
the expression domain is restricted to a single row of cells, the floor plate, but gradually 
broadens in the hindbrain and midbrain to become 5-7 cells in diameter, with a triangular 
shaped lateral extension in the ventral diencephalon and two strongly staining bulges at the 

30 tip of the forebrain, presumably in a region fated to become hypothalamus. 

As induction of Shh in the floor plate occurs, expression in the underlying mesoderm 
begins to fade away, in a similar maimer to axial (Strahle, U. et al., (1993) Genes & Dev. 7: 
1436-1446). This downregulation also proceeds in a rostral to caudal sequence, coinciding 
with the changes in cell shape that accompany notochord differentiation. By the 22 somite 
35 stage, mesodermal Shh expression is restricted to the caudal region of the notochord and in 
the expanding tail bud where a bulge of undifferentiated cells continue to express Shh at 
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relatively high levels. Expression in the midbrain broadens to a rhombic shaped area; 
cellular rearrangements that lead to the 90° kink of forebrain structures, position 
hypothalamic tissue underneath the ventral midbrain. These posterior hypothalamic tissues 
do not express Shh. In addition to Shh expression in the ventral midbrain, a narrow stripe of 
5 expressing cells extends dorsally on either side of the third ventricle from the rostral end of 
i the Shh domain in the ventral midbrain to the anterior end of, but not including, the 

epiphysis. The most rostral Shh expressing cells are confined to the hypothalamus. In the 
telencephalon, additional Shh expression is initiated in two 1-2 cell wide stripes. 

By 36 hours of development, Shh expression in the ventral CNS has undergone 
10 further changes. While expression persists in the floor plate of the tailbud, more rostrally 
located floor plate cells in the spinal cord cease to express the gene. In contrast, in the 
hindbrain and forebrain Shh expression persists and is further modified. 

At 26-28h, Shh expression appears in the pectoral fin primordia, that are visible as 
P placode like broadenings of cells underneath the epithelial cell layer that covers the yolk. By 

gi 15 33 hrs of development high levels of transcript are present in the posterior margin of the 
Nj pectoral buds; at the same time, expression is initiated in a narrow stripe at the posterior of 

jjj the first gill. Expression continues in the pectoral fin buds in lateral cells in the early larva. 

O At this stage, Shh transcripts are also detectable in cells adjacent to the lumen of the foregut. 

P 

^ 20 Two mutations affecting the differentiation of the Axial tissues that express Shh have 

C3 

|y been described in zebrafish embryos homozygous for the cyclops (eye) mutation lack a 

gg differentiated flooiplate (Hatta, K. et ah, (1991) Nature 350: 339-341). By contrast, 

V 5 homozygous notail (ntl) embryos are characterized by a failure in notochord maturation and a 

disruption of normal development of tail structures (Halpern, M.E. et aL, (1993) Cell 75: 99- 
25 111). 

A change in Shh expression is apparent in eye embryos as early as the end of 
gastrulation; at this stage, the anterior limit of expression coincides precisely with the two 
pax-2 stripes in the posterior midbrain. Thus, in contrast to wild-type embryos, no Shh 
expression is detected in midline structures of the midbrain and forebrain. By the 5 somite 
30 stage, Shh transcripts are present in the notochord which at this stage extends until 
rhombomere 4; however, no expression is detected in more anterior structures. Furthermore, 
no Shh expression is detected in the ventral neural keel, in particular in the ventral portions of 
the midbrain and forebrain. 



(vi) Expression Of Shh In Cyclops And Notail Mutants 



35 



At 24 hours of development, the morphologically visible eye phenotype consists of a 
fusion of the eyes at the midline due to the complete absence of the ventral diencephalon. As 
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at earlier developmental stages, Shh expression is absent from neural tissue. Shh expression 
in the extending tail bud of wild-type embryos is seen as a single row of floor plate cells 
throughout the spinal cord. In a eye mutant, no such Shh induction occurs in cells of the 
ventral spinal cord with the exception of some scattered cells that show transient expression 
near the tail. Similarly, no Shh expression is seen rostrally in the ventral neural tube. 
However, a small group of Shh expressing cells is detected underneath the epiphysis which 
presumably correspond to the dorsal-most group of Shh expressing cells in the diencephalon 
of wild-type embryos. 

In homozygous notail (ntl) embryos, no Shh staining is seen in mesodermal tissue at 
24 hours of development, consistent with the lack of a notochord in these embryos; by 
contrast, expression throughout the ventral CNS is unaffected. At the tail bud stage, 
however, just prior to the onset of somitogenesis, Shh expression is clearly detectable in 
notochord precursor cells. 

(vii) Injection Of Synthetic Shh Transcripts Into Zebrafish Embryos Induces Expression Of A 
Floor Plate Marker 

To investigate the activity of Shh in the developing embryo, an over-expression 
strategy, similar to that employed in the analysis of gene function in Xenopus, was adopted. 
Newly fertilized zebrafish eggs were injected with synthetic Shh RNA and were fixed 14 or 
24 hours later. As an assay for possible changes in cell fate consequent upon the ectopic 
activity of Shh* we decided to analyze Axial expression, since this gene serves as a marker for 
cells in which Shh is normally expressed. A dramatic, though highly localized ectopic 
expression of Axial in a significant proportion (21/80) of the injected embryos fixed after 24 
hours of development is observed. Affected embryos show a broadening of the Axial 
expression domain in the diencephalon and ectopic Axial expression in the midbrain; 
however, in no case has ectopic expression in the telencephalon or spinal cord been observed. 
Many of the injected embryos also showed disturbed forebrain structures, in particular 
smaller ventricles and poorly developed eyes. Amongst embryos fixed after 14h, a similar 
proportion (8/42) exhibit the same broadening and dorsal extension of the Axial stripe in the 
diencephalon as well as a dorsal extension of Axial staining in the midbrain; again, no 
changes in Axial expression were observed caudal to the hindbrain with the exception of an 
increased number of expressing cells at the tip of the tail. 

(viii) Overexpression Of Shh In Drosophila Embryos Activates The hh-Dependent Pathway 

In order to discover whether the high degree of structural homology between the 
Drosophila and zebrafish hh genes also extends to the functional level, an overexpression 
system was used to test the activity of Shh in flies. Expression of Drosophila hh driven by 
the HSP70 promoter results in the ectopic activation of both the normal targets of hh activity; 
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the wg transcriptional domain expands to fill between one third to one half of each 
parasegment whereas ptc is ectopically activated in all cells except those expressing en 
(Ingham, P.W. (1993) Nature 366:560-562). To compare the activities of the fly and fish 
genes, flies transgenic for a HS Shh construct were generated described above and subjected 
5 to the same heat shock regime as H Shh transgenic flies. HS Shh embryos fixed immediately 
after the second of two 30 min heat shocks exhibit ubiquitous transcription of the Shh cDNA. 
Similarly treated embryos were fixed 30 or 90 min after the second heat shock and assayed 
for wg or ptc transcription. Both genes were found to be ectopically activated in a similar 
manner to that seen in heat shocked H Shh embryos; thus, the zebrafish Shh gene can activate 
10 the same pathway as the endogenous hh gene. 



4= 
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Example 5 

p Cloning Expression and Localization of Human Hedgehogs 

gj (i) Experimental Procedures 

15 Isolation of human hedgehog cDNA clones. 

Degenerate nucleotides used to clone chick Shh (Riddle et aL, (1993) Cell 75:1401- 
1416) were used to amplify by nested PGR human genomic DNA. The nucleotide sequence 
*B of these oligos is as follows: 

9 

o 

N 20 vHH50:5 , -GGAATTCCCAG(CA)GITG(CT)AA(AG)GA(AG)(CA)(AG)I(GCT)TIAA-3 , 
|j (SEQIDNO:18); 

*0 vHH30:5 , -TCATCGATGGACCCA(GA)TC(GA)AAICCIGC(TC)TC-3 , (SEQ ID NO:19); 

vHH3I:5 , -GCTCTAGAGCTCIACIGCIA(GA)IC(GT)IGGIA-3 , (SEQ ID NO:20) 



25 The expected 220 bp PCR product was subcloned into pGEM7zf (Promega) and 

sequenced using Sequenase v2.0 (U.S. Biochemicals). One clone showed high nucleotide 
similarity to mouse Ihh and mouse Shh sequence (Echelard et al y (1993) Cell 75:1417-1430) 
and it was used for screening a human fetal lung 5 r -stretch plus cDNA library (Clontech) in X 
gtlO phage. The library was screened following the protocol suggested by the company and 

30 two positive plaques were identified, purified, subcloned into pBluescript SK-+- (Stratagene) 
and sequenced, identifying them as the human homologues of Shh (SEQ ID NO:6) and Ihh 
(SEQ ID NO:7). 

One clone contained the full coding sequence of a human homolog of Shh as well as 
150 bp of 5* and 36 bp of 3' untranslated sequence. The other clone, which is the human 
35 homolog of Ihh, extends from 330 bp 3' of the coding sequence to a point close to the 
predicted boundary between the first and second exon. The identity of these clones was 
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10 



determined by comparison to the murine and chick genes. The protein encoded by human 
Shh has 92.4% overall identity to the mouse Shh 9 including 99% identity in the amino- 
terminal half. The carboxyl-terminal half is also highly conserved, although it contains short 
stretches of 16 and 11 amino acids not present in the mouse Shh. The human Ihh protein is 
96.8% identical to the mouse Ihh. The two predicted human proteins are also highly related, 
particularly in their ammo-terminal halves where they are 91.4% identical. They diverge 
significantly in their carboxyl halves, where they show only 45.1% identity. The high level 
of similarity in the amino portion of all of these proteins implies that this region encodes 
domains essential to the activity of this class of signaling molecules. 



Northern blotting 

Multiple Tissue Northern Blot (Clontech) prepared from poly A+RNA isolated from 
human adult tissues was hybridized with either full length 32p_i a beled human Shh clone or 
T 32p_i a beled human Ihh clone following the protocol suggested by the company. 

0 15 

p Digoxigenin in situ hybridization. 

ST Sections: tissues from normal human second trimester gestation abortus specimens 

jj, were washed in PBS and fixed overnight at 4°C paraformaldehyde in PBS, equilibrated 24 

p hours at 4°C in 50% sucrose in PBS and then placed in 50% sucrose in oct for one hour 

^ 20 before embedding in oct. Cryostat sections (10-25 mm) were collected on superfrost plus 
p slides (Fisher) and frozen at -80°C until used. Following a postfixation in 4% 

N paraformaldehyde the slides were processed as in Riddle et al., (1993) Cell 75:1401-1416 

!jj with the following alterations: proteinase K digestion was performed at room temperature 

from 1-15 minutes (depending on section thickness), prehybridization, hybridization and 
^ 25 washes time was decreased to 1/1 0 of time. 

Whole-mounts: tissues from normal second trimester human abortus specimens were 

washed in PBS, fixed overnight at 4°C in 4% paraformaldehyde in PBS and then processed 

as in Riddle et al., (1993) Cell 75:1401-1416. 

30 Isolation of an Shh PI clone. 

The human Shh gene was isolated on a PI clone from a PI library (Pierce and 
Sternberg, 1992) by PCR (polymerase chain reaction) screening. Two oligonucleotide 
primers were derived from the human Shh sequence. The two olignucleotide primers used for 
PCR were: 

35 SHHF5'-ACCGAGGGCTGGGACGAAGATGGC-3' (SEQ ID NO:43) 
SHR5'-CGCTCGGTCGTACGGCATGAACGAC-3 r (SEQ ID NO:44) 
The PCR reaction was carried using standard conditions as described previously (Thierfelder 
et al., 1994) except that the annealing temperature was 65 °C. These primers amplified a 1 19 
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bp fragment from human and PI clone DNA. The PI clone was designated SHHP1. After 
the PI clone was isolated these oligonucleotides were used as sequencing primers. A 
2.5Kb£coRI fragment that encoded a CA repeat was subcloned from this PI clone using 
methods described previously (Thierfelder et al. 1994). Oligonucleotide primers that 
amplified this CA repeat sequence were fashioned from the flanking sequences: 
SHHCAFS'-ATGGGGATGTGTGTGGTCAAGTGTA-S 1 (SEQ ID NO:45) 
SHHCAR5'-TTCACAGACTCTCAAAGTGTATTTT-3 r (SEQ ID NO:46) 

Mapping the human Ihh and Shh genes. 

The human Ihh gene was mapped to chromosome 2 using somatic cell hybrids from 
NIGMS mapping pannel 2 (GM10826B). 

The Shh gene was mapped to chromosome 7 using somatic cell hybrids from NIGMS 
mapping panel 2 (GM10791 and GM10868). 

Linkage between the limb deformity locus on chromosome 7 and the Shh gene was 
demonstrated using standard procedures. Family LD has been described previously 
(Tkukurov et al, (1994) Nature Genet. 6:282-286). A CA repeat bearing sequence near the 
Shh gene was amplified from the DNA of all members of Family LD by PCR using the 
SHHCAF and SHHCAR primers. Linkage between the CA repeat and the LD disease gene 
segregating in Family LD was estimated by the MLINK program (Oct, 1967). Penetrance 
was set at 100% and the allele frequencies were determined using unrelated spouses in the 
LD family. 

Interspecific Backcross Mapping. 

Interspecific backcross progeny were generated by mating (C57BL/6J x M. spretus) 
Fl females and C57BL/6J males as described (Copeland and Jenkins, (1991) Trends Genet. 
7:113-118). A total of 205 N2 mice were used to map the Ihh and Dhh loci. DNA isolation, 
restriction enzyme digestions, agarose gel electrophoresis, Southern blot transfer and 
hybridization were performed essentially as described (Jenkins et aL, (1982) J. Virol 43:26- 
36). All blots were prepared with Hybond-N+ nylon membrane (Amersham). The probe, an 
~ 1.8kb EcdSl fragment of mouse cDNA, detected a major fragment of 8.5 kb in C57BL/6j 
(B) DNA and a major fragment 6.0 kb in M. spretus (S) DNA following digestion with BgRl. 
The Shh probe, an ~ 900 bp Smal fragment of mouse cDNA, detected Hindi fragments of 7.5 
and 2.1 kb (B) as well as 4.6 and 2.1 (S). The Dhh probe, and ~ 800 bp BamHi/EcoRi 
fragment of mouse genomic DNA, detected major fragments of 4.7 and 1.3 kb (B) and 8.2 
and 1.3 kb (S) following digestion with Sphl. The presence or absence of M spretus specific 
fragments was followed in backcross mice. 

A description of the probes and RFLPs for loci used to position the Ihh, Shh and Dhh 
loci in the interspecific backcross has been reported. These include: Fnl, Vil and Acrg 7 
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chromosome 1 (Wilkie et al, (1993) Genomics 18:175-184), Gnail, En2 9 116, chromosomes 
5 (Miao et al, (1994) PNAS USA 91:11050-1 1054) and Pdgfb, Gdcl and Rarg, chromosome 
15 (Brannan et al, (1992) Genomics 13:1075-1081). Recombination distances were 
calculated as described (Green, (1981) Linkage, recombination and mapping. In ''Genetics 
5 and Probability in Animal Breeding Experiments", pp. 77-1 13, Oxford University Press, NY) 
using the computer program SPRETUS MADNESS. Gene order was determined by 
minimizing the number of recombination events required to explain the allele distribution 
patterns. 



10 (ii) Expression of Human Shh and Ikh 

To investigate the tissue distribution of Shh and Ihh expression, poly(A)+RNA 
samples from various adult human tissues were probed with the two cDNA clones. Of the 
tissues tested, an Tfc/i-specific message of -2.7 kb is only detected in liver and kidney. Shh 
transcripts was not detected in the RNA from any of the adult tissues tested. All the samples 
Q 15 contained approximately equal amounts of intact RNA, as determined by hybridization with a 
control probe. 

N The hedgehog family of genes were identified as mediators of embryonic patterning 

4= 

% ' n in flies and vertebrates. No adult expression of these genes had previously been reported. 

O These results indicate that Ihh additionally plays a role in adult liver and kidney. Since the 

^ 20 hedgehog genes encode intercellular signals, Ihh may function in coordinating the properties 

ft 

P of different cell types in these organs. Shh may also be used as a signaling molecule in the 

adult, either in tissues not looked at here, or at levels too low to be detected under these 

Q 

p- k conditions. 

10 In situ hybridization was used to investigate the expression of Shh in various mid- 

^' 25 gestational human fetal organs. Shh expression is present predominantly in endoderm 
derived tissues: the respiratory epithelium, collecting ducts of the kidney, transitional 
epithelium of the ureter, hepatocytes, and small intestine epithelium. Shh was not detectable 
in fetal heart or placental tissues. The intensity of expression is increased in primitive 
differentiating tissues (renal blastema, base villi, branching lung buds) and decreased or 
30 absent in differentiated tissues (e.g. glomeruli). Shh expression is present in the mesenchyme 
immediately abutting the budding respiratory tubes. The non-uniform pattern of Shh 
expression in hepatocytes is consistent with expression of other genes in adult liver 
(Dingemanse et al, (1994) Differentiation 56:153-162). The base of villi, the renal blastema, 
and the lung buds are all regions expressing Shh and they are areas of active growth and 
35 differentiation, suggesting Shh is important in these processes. 



(Hi) The Chromosomal Map Location of Human Shh and Ihh. 
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Since Shh is known to mediate patterning during the development of the mouse and 
chick and the expression of Shh and Ihh are suggestive of a similar role in humans, fiiutations 
in these genes would be expected to lead to embryonic lethality or congenital defects. One 
way of investigating this possibility is to see whether they are genetically linked to any 
5 known inherited disorders. 

Shh- and TWt-specific primers were designed from their respective sequences and were 
used in PCR reactions on a panel of rodent-human somatic cell hybrids. Control rodent DNA 
did not amplify specific bands using these primers. In contrast, DNA from several rodent- 
human hybrids resulted in PCR products of the appropriate size allowing us to assign Shh to 
1 0 chromosome 7q and Ihh to chromosome 2. 

One of the central roles of chick Shh is in regulating the anterior-posterior axis of the 
limb. A human congenital polysyndactyly has recently been mapped to chromosome 7q36 
(Tsukurov et al 9 (1994) Nature Genet 6:282-286; Heutink et al 9 (1994) Nature Genet. 
6:287-291). The phenotype of this disease is consistent with defects that might be expected 
O 15 from aberrant expression of Shh in the limb. Therefore, the chromosomal location of Shh 

was mapped more precisely, in particular in relation to the polysyndactyly locus. 
N API phage library was screened using the Shh specific primers for PCR amplification 

Jj and clone SHHP1 was isolated. Clone SHHP1 contained Shh sequence. A Southern blot of 

Q an EcoRi digest of this phage using [CA]/[GT] probe demonstrated that a 2.5 Kb EcoRi 

0 20 fragment contained a CA repeat. Nucleotide sequence analysis of this subcloned EcoRi 
q . fragment demonstrated that the CA repeat lay near the EcoRi sites. Primers flanking the CA 
repeat were designed and used to map the location of Shh relative to other markers on 7q in 
individuals of a large kindred with complex polysyndactyly (Tsukurov et aL 9 (1994) Nature 
)S Genet 6:282-286). Shh maps close to D75550 on 7q36, with no recombination events seen 

^ 25 m this study. It is also extremely close to, but distinct from, the polysyndactyly locus with 
one recombination event observed between them (maximum lod score = 4.82, 0 = 0.05). 
One unaffected individual (pedigree ID V-10 in Tsukurov et al 9 (1994) Nature Genet 6:282- 
286) has the Shh linked CA repeat allele found in all affected family members. No 
recombination was observed between the locus En2 and the Shh gene (maximum lod score = 
30 1.82,0 = 0.0). 

(iv) Chromosomal mapping of the Murine Ihh, Shh and Dhh genes. 

The murine chromosomal location of Ihh, Shh and Dhh was determined using an 
interspecific backcross mapping panel derived from crosses of [(C57BL/6J x M. spetrus)Fl X 
35 C57BL/J)] mice. cDNA fragments from each locus were used as probes in Southern blot 
hybridization analysis of C57BL/6J and M. spretus genomic DNA that was separately 
digested with several different restriction enzymes to identify informative restriction 
fragment length polymorphisms (RFLPs) useful for gene mapping. The strain distribution 
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pattern of each RFLP in the interspecific backcross was then determined by following the 
presence or absence of RFLPs specific for M spretus in backcross mice. 

Ihh mapped to the central region of mouse chromosome 1, 2.7 cM distal of Fnl and 
did not recombine with Vil in 190 animals typed in common, suggesting that the two loci are 
5 within 1 .6 cM (upper 95% confidence level) (Fig. 16). Shh mapped to the proximal region of 
mouse chromosome 5, 0.6 cM distal of En2 and 1.9 cM proximal of 116 in accordance to 
Chang et al y (1994) Development 120:3339-3353. Dhh mapped to the very distal region of 
mouse chromosome 15, 0.6 cM distal of Gdcl and did not recombine with Rarg in 160 
animals typed in common, suggesting that the two loci are within 1 .9 cM of each other (upper 
1 0 95% confidence level) (Fig. 1 6). 

Interspecific maps of chromosome 1, 5 and 15 were compared with composite mouse 
linkage maps that report the map location of many uncloned mouse mutations (compiled by 
M.T. Davisson, T.H. Roderick, A.L. Hillyard and D.P. Doolittle and provided from GBASE, 
a computerized database maintained at The Jackson Laboratory, Bar Harbor, ME). The 
p 15 hemimelic extra-toe (Hx) mouse mutant maps LI cM distal to En2 on chromosome 5 (Martin 
P et aL, (1990) Genomics 6:302-308), a location in close proximity to where Shh has been 

positioned . Hx is a dominant mutation which results in preaxial Polydactyly and hemimelia 
affecting all four limbs (Dickie, (1968) Mouse News Lett 38:24; Knudsen and Kochhar, 
p (1981) J. Embryol Exp, Morph 65: Suppl. 289-307). Shh has previously been shown to be 

^0 20 expressed in the limb (Echelard et al> (1993) Cell 75:1417-1430). To determine whether Shh 
q and Hx are tightly linked we followed their distribution in a backcross panel in which Hx was 

M segregating. Two recombinants between Shh and Hx were identified, thus excluding the 

|y possibility that the two loci are allelic and these observations are again consistent with those 

of Chang et al , (1994) Development 120:3339-3353 . While there are several other mutations 
9* 25 in the vicinity of Ihh and Dhh 9 none is an obvious candidate for an alteration in the 
corresponding gene. 

The central region of mouse chromosome 1 shares homology with human 
chromosome 2q (summarized in Fig. 16). Placement of Ihh in this interval suggests the 
human homolog of Ihh will reside on 2q, as well. Similarly, it is likely that human homolog 
30 of Dhh will reside on human chromosome 12q. 



Example 6 

Proteolytic Processing Yields Two Secreted Forms of Sonic Hedgehog 
(i) Experimental Procedures 
35 In vitro Translation and Processing 
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Mouse and chick sonic hedgehog coding sequences were inserted into the vector 
pSP64T (kindly provided by D. Melton) which contains an SP6 phage promoter and both 5* 
and 3' untranslated sequences derived from the Xenopus laevis (5-Globin gene. After 
restriction endonuclease digestion with Sal I to generate linear templates, RNA was 
5 transcribed in vitro using SP6 RNA polymerase (Promega, Inc.) in the presence of 1 mM cap 
structure analog (m 7 G(5 , )ppp(5')Gm; Boehringer-Mannheim, Inc.) Following digestion with 
RQ1 DNase I (Promega, Inc.) to remove the DNA template, transcripts were purified by 
phenolxholoroform extraction and ethanol precipitation. 

Rabbit reticulocyte lysate (Promega, Inc.) was used according to the manufacturer's 
10 instructions. For each reaction, 12.5 \i\ of lysate was programmed with 0.5-2.0 (ig of in vitro 
transcribed RNA. The reactions contained 20 \id of Express labeling mix (NEN/DuPont, 
Inc.) were included. To address processing and secretion in vitro, 1.0-2.0 pi of canine 
pancreatic microsomal membranes (Promega, Inc.) were included in the reactions. The final 
reaction volume of 25 ^il was incubated for one hour at 30°C. Aliquots of each reaction 
15 (between 0.25 and 3.0 were boiled for 3 minutes in Laemmli sample buffer (LSB: 125 
mM Tris-Hcl [pH 6.8]; 2% SDS; 1% 2-mercaptoethanol; 0.25 mg/ml bromophenol blue) 
before separating on a 15% polyacrylamide gel. Fixed gels were processed for fluorography 
using EnHance (NEN/DuPont, Inc.) as described by the manufacturer. 

Glycosylation was addressed by incubation with Endoglycosidase H (Endo H; New 
20 England Biolabs, Inc.) according to the manufacturer's directions. Reactions were carried out 
for 1-2 hr at 37°C before analyzing reaction products by polyacrylamide gel electrophoresis 
(PAGE). 

Xenopus Oocyte Injection and Labeling 

Oocytes were enzymatically defolliculated and rinsed with OR2 (50 mM HEPES [pH 
25 7.2], 82 mM NaCI, 2.5 mM KC1, 1.5 mM Na2HP04). Healthy stage six oocytes were 
injected with 30 ng of in vitro transcribed, capped mouse Shh RNA (prepared as described 
above). Following a 2 hr recovery period, healthy injected oocytes and uninjected controls 
were cultured at room temperature in groups of ten in 96-well dishes containing 0.2 ml of 
OR2 (supplemented with 0.1 mg/ml Gentamicin and 0.4 mg/ml BSA) per well. The 
30 incubation medium was supplemented with 50 jiCi of Express labeling mix. Three days after 
injection, the culture media were collected and expression of Shh protein analyzed by 
immunoprecipitation. Oocytes were rinsed several times in OR2 before lysing in TENT (20 
mM Tris-HCl [pH 8.0]; 150 mM NaCI, 2rnM EDTA; 1% Triton-X-100; 10 ^il/oocyte) 
supplemented with 1 |ig/ml aprotinin, 2 jig/ml leupeptin and ImM 
35 phenylmethylsufonylfluoride (PMSF). After centrifugation at 13000 x g for 10 minutes at 
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4°C, soluble protein supernatants were recovered and analyzed by immunoprecipitation (see 
below). 



Cos Cell Transection and Labeling 

5 Cos cells were cultured in Dulbecco's Modified Eagle Medium (DMEM; Sigma, Inc.) 

supplemented with 10% fetal bovine serum (Gibco/BRL), 2 mM L-Glutamine (Gibco/BRL) 
and 50 mU/ml penicillin and 50 (ig/ml streptomycin (Gibco/BRL). Subconfluent cos cells in 
35 mm or 60 mm dishes (Falcon, Inc.) were transiently transfected with 2 mg or 6 mg 
supercoiled plasmid DNA, respectively. Between 42 and 44 hr post-transfection, cells were 
10 labeled for 4-6 hr in 0.5 ml (35 mm dishes) or 1.5 ml (60 mm dishes) serum-free DMEM 
lacking Cysteine and Methionine (Gibco/BRL) and supplemented with 125 nCi/ml each of 
Express labeling mix and L-35S-Cysteine (NEN/DuPont). After labeling, media were 
collected and used for immunoprecipitation. Cells were rinsed with cold PBS and lysed in 
$g the tissue culture dishes by the addition of 0.5 ml (35 mm dishes) or 1.5 ml (60 mm dishes) 
P 1 5 TENT (with protease inhibitors as described above) and gentle rocking for 30 minutes at 4°C. 
_g Lysates were cleared by centrifiigation (13000 x g for 5 min. at 4°C) and the supernatants 

were analyzed by immunoprecipitation (see below). 

P 

*D Baculovirus Production and Infection 

P A recombinant baculovirus expressing mouse sonic hedgehog with a myc epitope tag 

p 20 inserted at the carboxy terminus was generated using the Baculogold kit (Pharmingen, Inc.). 
|U The initial virus production used Sf 9 cells, followed by two rounds of amplification in High 

p Five cells (Invitrogen, Inc.) in serum-free medium (ExCell 401; Invitrogen, Inc.). A 

baculovirus lacking Shh coding sequences was also constructed as a control. For protein 
induction, High Five cells were infected at a multiplicity of approximately 15. Three days 
25 later, medium and cells were collected by gentle pipetting. Cells were collected by 
centrifiigation (1000 x g) and the medium was recovered for Western blot analysis. Cell 
pellets were washed twice in cold PBS and lysed in TENT plus protease inhibitors (see 
above) by rotating for 30 minutes at 4°C in a microcentrifuge tube. The lysate was cleared as 
described above prior to Western blotting. 

30 Western Blotting 

For Western blotting, 0.25 ml samples of media (1% of the total) were precipitated 
with TCA and redissolved in 15 ^il of LSB. Cell lysate samples (1% of total) were brought to 
a final volume of 15 \x\ with water and concentrated (5X) LSB . Samples were boiled S 
minutes prior to separation on a 15% aciylamide gel. Proteins were transferred to PVDF 
35 membrane (Immobilon-P; Millipore, Inc.) and blocked in BLOTTO (5% w/v non-fat dried 
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milk in PBS) containing 0.2% Tween-20. Hybridoma supernatant recognizing the humanc- 
myc epitope (9E10; Evan, G.I. et al., (1985) Mol Cell Biol 5:3610-3616) was added at a 
dilution of 1:200 for one hour followed by a 1:5000 dilution of Goat anti-Mouse-Alkaline 
phosphatase conjugate (Promega, Inc.) for 30 minutes. Bands were visualized using the 
Lumi-Phos 530 reagent (Boehringer-Mannheim) according to the manufacturer's directions. 

For Western blotting of COS cell material, cleared media (see above) were 
precipitated with TCA in the presence of 4^g of BSA per ml as a carrier, the protein pellets 
were dissolved in 20|al of LSB. Dissolved medium protein and cell lysates (see above) were 
boiled for 5 min, and lOjal (50%) of each medium sample and 10|il (10%) of each cell lysate 
were separated on a 15% acrylamide gel. The gel was blotted to a polyvinylidene difluoride 
membrane as described above. The membrane was blocked as described above and incubated 
in a 1:200 dilution of affinity-purified Shh antiserum (see below) and then in a 1:5,000 
dilution of horseradish peroxidase-conjugated donkey anti-rabbit immunoglobulin g (IgG; 
Jackson Immuno research, Inc.). Bands were visualized with the Enhanced 
Chemiluminescence kit (Amersham, Inc.) according to the manufacturer's instructions. 

For Western blotting of mouse and chicken embryonic tissue lysates, 60|ig of each 
sample was separated on 15% acrylamide gels. Blotting and probing with affinity-purified 
Shh antiserum as well as chemiluminescence detection were carried out as described above 
for the COS cell material. 

Immunoprecipitation 

Cell lysates (Xenopus oocytes or cos cells) were brought to 0.5 ml with TENT (plus 
protease inhibitors as above). Media samples (OR2 or DMEM) were cleared by 
centrifugation at 13000 x g for 5 min. (4°C) and 10X TENT was added to a final 
concentration of IX (final volume: 0.5-1.5 ml). The c-myc monoclonal antibody hybridoma 
supernatant was added to 1/20 of the final volume. Samples were rotated for 1 hr at 4°C, 
then 0.1 ml of 10% (v/v) protein A-Sepharose CL-4B (Pharmacia, Inc.) was added. Samples 
were rotated an additional 14-16 h. Immune complexes were washed 4 times with 1.0 ml 
TENT. Immunoprecipitated material was eluted and denatured by boiling for 10 minutes in 
25 jil IX LSB. Following centrifugation, samples were separated on 15% acrylamide gels 
and processed for fluorography as described previously. Samples for Endo H digestion were 
eluted and denatured by boiling for 10 minutes in the provided denaturation buffer followed 
by digestion with Endo H for 1-2 hr at 37°C. Concentrated (SX) LSB was added and the 
samples were processed for electrophoresis as described. 

For immunoprecipitation with the anti-mouse Shh serum, samples (Cos cell lysates 
and DMEM) were precleared by incubating 1 hr on ice with 3 fil pre-immune serum, 
followed by the addition of 0.1 ml 10% (v/v) Protein A-Sepharose. After rotating for 1 hr at 
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4 C, supernatants were recovered and incubated for 1 hr on ice with 3^1 depleted anti-mouse 
Shh serum (see below). Incubation with Protein A-Sepharose, washing, elution and 
electrophoresis were then performed as described above. 

Immunqfluorescent Staining of Cos Cells 

Twenty-four hours after transfection, cells were transferred to 8-chamber slides (Lab- 
Tek, Inc.) and allowed to attach an additional twenty-four hours. Cells were fixed in 2% 
paraformaldehyde/0.1% glutaraldehyde, washed in PBS and permeabilized in 1% Triton-X- 
100 (Munro, S. and Pelham, H.R.B., (1987) Cell 48:899-907). After washing in PBS, cells 
were treated for 10 minutes in 1 mg/ml sodium borohydride. Cells were incubated with the 
c-myc monoclonal antibody hybridoma supernatant (diluted 1:10) and the affinity purified 
mouse Sonic hedgehog antiserum (diluted 1:4) for 45 minutes followed by incubation in 
1:100 Goat-anti Mouse IgG-RITC plus 1:100 Goat anti Rabbit IgG F1TC (Southern 
Biotechnology Associates, Inc.) for 45 minutes. DAPI (Sigma, Inc.) was included at 0.3 \i 
g/ml The slides were mounted in Slo-Fade (Molecular Probes, Inc.) and photographed on a 
Leitz DMR compound microscope. 

Embryonic Tissue Dissection and Lysis 

Mouse forebrain, midbrain, hindbrain, lung, limb, stomach, and liver tissues form 
15.5-day-postcoitum Swiss Webster embryos were dissected into cold PBS, washed several 
times in PBS, and then lysed by trituraton and gentle sonication in LSB lacking bromophenol 
blue. Lysates were cleared by brief centrifugation, and protein concentrations were 
determined by the Bradford dye-binding assay. 

To obtain chicken CNS and limb bud tissue, fertilized eggs (Spafas, Inc.) were 
incubated at 37°C until the embryos reached stages 20 and 25, respectively (Hamburg and 
Hamilton (1951) J. Exp. MorphoL 88:49-92). By using sharp tungsten needles, dorsal and 
ventral pieces of the anterior CNS were obtained from the stage 15 embryos, and limb buds 
from the stage 25 embryos were cut into anterior and posterior halves. Tissues were lysed, 
and protein concentrations were determined as described above. Prior to electrophoresis of 
the mouse and chicken proteins (see above), samples were brought to 20 \i\ with LSB 
containing bromophenol blue and boiled for 5 minutes. 

Antibody Production and Purification 

A PCR fragment encoding amino acids 44-143 of mouse Sonic hedgehog was cloned 
in frame into the Eco Rl site of pGEX-2T (Pharmacia, Inc.). Transformed bacteria were 
induced with IPTG and the fusion protein purified on a Glutathione- Agarose affinity column 
(Pharmacia, Inc.) according to the manufacturer's instructions. Inoculation of New Zealand 
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White rabbits, as well as test and production bleeding were carried out at Hazelton Research 
Products, Inc. 

To deplete the serum of antibodies against Glutathione-S-transferase (GST) and 
bacterial proteins, a lysate of E. coli transformed with pGEX-2T and induced with IPTG was 
coupled to Affi-Gel 10 (Bio-Rad, Inc.) The serum was incubated in batch for two hours with 
the depletion matrix before centrifugation (1000 x g for 5 min.) and collection of the 
supernatant. To make an affinity matrix, purified bacterially expressed protein corresponding 
to the amino terminal two-thirds of mouse Sonic hedgehog was coupled to Affi-Gel 10 (Bio- 
Rad, Inc.). The depleted antiserum was first adsorbed to this matrix in batch, then transferred 
to a column. The matrix was washed with TBST (25 mM Tris-HCl [pH 7.5], 140 mM NaCl, 
5 mM KC1, 0.1% Triton-X-100), and the purified antibodies were eluted with ten bed 
volumes of 0.15 M Glycine [pH 2.5]. The solution was neutralized with one volume of 1 M 
Tris-HCl [pH 8.0], and dialyzed against 160 volumes of PBS. 

Other antibodies have been generated against hedgehog proteins and three polyclonal 
rabbit antisera obtained to hh proteins can be characterized as follows: Ab77 -reacts only with 
the carboxyl processed chick Shh peptide (27 kd); Ab79 -reacts with amino processed chick, 
mouse and human Shh peptide (19 kd). Weakly reacts with 27 kd peptide from chick and 
mouse. Also reacts with mouse Ihh; and Ab80 -reacts with only amino peptide (19kd) of 
chick, mouse and human. 

(ii) In Vitro Translated Sonic Hedgehog is Proteolyticalfy Processed and Glycosylated 

The open reading frames of chick and mouse Shh encode primary translation products 
of 425 and 437 amino acids, respectively, with predicted molecular masses of 46.4 
kilodaltons (kDa) and 47.8 kDa (Echelard, Y. et aL, (1993) Cell 75:1417-1430; Riddle, R.D. 
et aL, (1993) Cell 75:1401-1416). Further examination of the protein sequences revealed a 
short stretch of amino terminal residues (26 for chick, 24 for mouse) that are highly 
hydrophobic and are predicted to encode signal peptides. Removal of these sequences would 
generate proteins of 43.7 kDa (chick Shh) and 45.3 kDa (mouse Shh). Also, each protein 
contains a single consensus site for N-linked glycosylation (Tarentino, A.L. et aL, (1989) 
Methods Cell Biol 32:111-139) at residue 282 (chick) and 279 (mouse). These features of 
the Shh proteins are summarized in Figure 1 1 . 

A rabbit reticulocyte lysate programmed with in vitro translated messenger RNA 
encoding either chick or mouse Shh synthesizes proteins with molecular masses of 46 kDa 
and 47 kDa, respectively. These values are in good agreement with those predicted by 
examination of the amino acid sequences. To examine posttranslational modifications of Shh 
proteins, a preparation of canine pancreatic microsomal membranes was included in the 
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translation reactions. This preparation allows such processes as signal peptide cleavage and 
core glycosylation. When the Shh proteins are synthesized in the presence of these 
membranes, two products with apparent molecular masses of approximately 19 and 28 kDa 
(chick), or 19 and 30 kDa (mouse) are seen in addition to the 46 kDa and 47 kDa forms. 
5 When the material synthesized in the presence of the membranes is digested with 
Endoglycosidase H (Endo H), the mobilities of the two larger proteins are increased. The 
apparent molecular masses of the Endo H digested forms are 44 kDa and 26 kDa for chick 
Shh, and 45kDa and 27 kDa for mouse Shh. The decrease in the molecular masses of the 
largest proteins synthesized in the presence of the microsomal membranes after Endo H 
10 digestion is consistent with removal of the predicted signal peptides. The mobility shift 
following Endo H treatment indicates that N-linked glycosylation occurs, and that the 26 kDa 
(chick) and 27 kDa (mouse) proteins contain the glycosylation sites. 

The appearance of the two lower molecular weight bands (hereafter referred to as the 
P "processed forms") upon translation in the presence of microsomal membranes suggests that 

M 15 a proteolytic event in addition to signal peptide cleavage takes place. The combined 
J] molecular masses of the processed forms (19 kDa and 26 kDa for chick; 19 kDa and 27 kDa 

p for mouse) add up to approximately the predicted masses of the signal peptide cleaved 

Jjj proteins (44 kDa for chick and 45 kDa for mouse) suggesting that only a single additional 

^ cleavage occurs. 

» 

Q 20 The mouse Shh protein sequence is 12 amino acid residues longer than the chick 

^ sequence (437 versus 425 residues). Alignment of the chick and mouse Shh protein 

pi sequences reveals that these additional amino acids are near the carboxy terminus of the 

iO protein (Echelard, Y. et aL, (1993) Cell 75:1417-1430). Since the larger of the processed 

^ J forms differ in molecular mass by approximately 1 kDa between the two species, it appears 

25 that these peptides contain the carboxy terminal portions of the Shh proteins. The smaller 

processed forms, whose molecular masses are identical, presumably consist of the amino 

terminal portions. 

(Hi) Secretion of Shh Peptides 

To investigate the synthesis of Shh proteins in vivo, the mouse protein was expressed 
30 in several different eukaryotic cell types. In order to detect synthesized protein, and to 
facilitate future purification, the carboxy terminus was engineered to contain a twenty-five 
amino acid sequence containing a recognition site for the thrombin restriction protease 
followed by a ten amino acid sequence derived from the human c-myc protein and six 
consecutive histidine residues. The c-myc sequence serves as an epitope tag allowing 
35 detection by a monoclonal antibody (9E10; Evan, GJ. et aL, (1985) Mol Cell Biol 5:3610- 
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3616). The combined molecular mass of the caxboxy terminal additions is approximately 3 
kDa. 



5 Xenopus laevis oocytes 

Immunoprecipitation with the c-myc antibody detects several proteins in lysates of 
metabolically labeled Xenopus laevis oocytes injected with Shh mRNA. Cell lysates and 
medium from 35 S labeled oocytes injected with RNA encoding mouse Shh with the c-myc 
epitope tag at the at the carboxy terminus, or from control oocytes were analyzed by 
10 immunoprecipitation with c-myc monoclonal antibody. A band of approximately 47 kDa is 
seen, as is a doublet migrating near 30 kDa. Treatment with Endo H increases the mobility of 
the largest protein, and resolves the doublet into a single species of approximately 30 kDa. 
These observations parallel the behaviors seen in vitro. Allowing for the added mass of the 
carboxy terminal additions, the largest protein would correspond to the signal peptide cleaved 
4 15 form, while the doublet would represent the glycosylated and unglycosylated larger processed 
JEj form. Since the epitope tag was placed at the carboxy terminus of the protein, the identity of 

p the 30 kDa peptide as the carboxy terminal portion of Shh is confirmed. Failure to detect the 

^ 19 kDa species supports its identity as an amino terminal region of the protein. 



To test whether Shh is secreted by Xenopus oocytes, the medium in which the 
! 20 injected oocytes were incubated was probed by immunoprecipitation with the c-myc 
f antibody. A single band migrating slightly more slowly than the glycosylated larger 

\ processed form was observed. This protein is insensitive to Endo H. This result is expected 

since most secreted glycoproteins lose sensitivity to Endo H as they travel through the Golgi 
apparatus and are modified by a series of glycosidases (Kornfeld, R. and Kornfeld, S., (1985) 
25 Annu. Rev. Biochem. 54:631-664). The enzymatic maturation of the Asn-linked carbohydrate 
moiety could also explain the slight decrease in mobility of the secreted larger protein versus 
the intracellular material. Following Endo H digestion, a band with a slightly lower mobility 
than the signal peptide cleaved protein is also apparent, suggesting that some Shh protein is 
secreted without undergoing proteolytic processing. Failure to detect this protein in the 
30 medium without Endo H digestion suggests heterogeneity in the extent of carbohydrate 
modification in the Golgi preventing the material from migrating as a distinct band. 
Resolution of this material into a single band following Endo H digestion suggests that the 
carbohydrate structure does not mature completely in the Golgi apparatus. Structural 
differences between the unprocessed protein and the larger processed form could account for 
35 this observation (Kornfeld, R. and Kornfeld, S., (1985) Annu. Rev. Biochem. 54:631-664). 
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Cos cells 

The behavior of mouse Shh in a mammalian cell type was investigated using 
transfected cos cells. Synthesis and secretion of the protein was monitored by 
immunoprecipitation using the c-myc antibody. Transfected cos cells express the same Sonic 
5 hedgehog species that were detected in the injected Xenopus oocytes, and their behavior 
following Endo H digestion is also identical. Furthermore, secretion of the 30 kDa 
glycosylated form is observed in cos cells, as well as the characteristic insensitivity to Endo 
H after secretion. Most of the secreted protein co-migrates with the intracellular, 
glycosylated larger processed form, but a small amount of protein with a slightly lower 
10 mobility is also detected in the medium. As in the Xenopus oocyte cultures, some Shh which 
has not undergone proteolytic processing is evident in the medium, but only after Endo H 
digestion. 

Baculovirus infected cells 

O 

To examine the behavior of the mouse Shh protein in an invertebrate cell type, and to 

j*J 15 potentially purify Shh peptides, a recombinant baculovirus was constructed which placed the 

jg Shh coding sequence, with the carboxy terminal tag, under the control of the baculoviral 

Ul Polyhedrin gene promoter. When insect cells were infected With the recombinant 

>j baculovirus, Shh peptides could be detected in cell lysates and medium by Western blotting 

f with the c-myc antibody. 

P 

"M 20 The Shh products detected in this system were similar to those described above. 

U However, virtually no unprocessed protein was seen in cell lysates, nor was any detected in 

\D the medium after Endo H digestion. This suggests that the proteolytic processing event 

^ s , occurs more efficiently in these cells than in either of the other two cell types or the in vitro 
translation system. A doublet corresponding to the glycosylated and unglycosylated 30 kDa 
25 forms is detected, as well as the secreted, Endo I resistant peptide as seen in the other 
expression systems. Unlike the other systems, however, all of the secreted larger processed 
form appears to comigrate with the glycosylated intracellular material. 

(iv) Secretion of a Highly Conserved Amino Terminal Peptide 

To determine the behavior of the amino terminal portion of the processed Sonic 
30 hedgehog protein, the c-myc epitope tag was positioned 32 amino acids after the putative 
signal peptide cleavage site (Figure 12). Cos cells were transfected with Shh expression 
constructs containing the c-myc tag at the carboxy terminus or near the amino terminus. 
. When this construct was expressed in cos cells, both the full length protein and the smaller 
processed form (approximately 20 kDa due to addition of the c-myc tag) were detected by 
35 immunoprecipitation of extracts from labeled cells. However, the 20 kDa product is barely 
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detected in the medium. In cells transfected in parallel with the carboxy terminal c-myc 
tagged construct, the full length and 30 kDa products were both precipitated from cell lysates 
and medium as described earlier. 

As the amino terminal c-myc tag may affect the secretion efficiency of the smaller 
5 processed form, the expression of this protein was examined in cos cells using an antiserum 
directed against amino acids 44 through 143 of mouse Shh (Figure 12). After transfection 
with the carboxy-terminal c-myc tagged construct, immunoprecipitation with the anti-Shh 
serum detected a very low level of the smaller processed form in the medium despite a strong 
signal in the cell lysate. This recapitulates the results with the myc antibody. 

10 To examine the subcellular localization of Shh proteins, cos cells were transfected 

with the carboxy terminal tagged Shh construct and plated on multi-chamber slides, fixed and 
permeabilized. The cells were incubated simultaneously with the anti-Shh serum and the c- 
myc antibody followed by FITC conjugated Goat anti-Rabbit-IgG and RITC conjugated Goat 
O anti-Mouse-IgG. DAPI was included to stain nuclei. Strong perinuclear staining 
m 15 characteristic of the Golgi apparatus was observed with the anti-SAT* serum. The same 
^ subcellular region was also stained using the c-myc antibody. The coincidence of staining 
patterns seen with the two antibody preparations suggest that the low level of the smaller 
p processed form detected in the medium is not due to its retention in the endoplasmic 

reticulum, since both processed forms traffic efficiently to the Golgi apparatus, 
p 20 One explanation for the failure to detect large amounts of the smaller processed form 

^ in the culture medium could that this protein associates tightly with the cell surface or ECM. 
fc! To examine this, cells were treated with the polyanionic compounds herparin and suramin. 
\S These compounds have been shown to increase the levels of some secreted proteins in culture 
^ medium, possibly by displacing them from cell surface or ECM components or by directly 
25 binding the proteins and perhaps protecting them from proteolytic degradation (Bradley and 
Brown (1990) EMBO J. 9:1569-1575; Middaugh et al. (1992) Biochem. 31:9016-9024; 
Smolich et al. (1993) Mol Biol Cell 4:1267-1275). The 19-kDa ammo-terminal form of Shh 
is barely detectable in the medium of transfected COS cells, despite its obvious presence in 
the cell lysate. However, in the presence of 10 mg of heparin per ml, this peptide is readily 
30 detected in the medium. The addition of 10 mM suramin to the medium has an even greater 
effect. Since the concentrations used where those previously determined to elicit maximal 
responses, it is clear that suramin is more active than heparin in this assay. 

The ability of heparin and suramin to increase the amount of the smaller processed 
form in the medium of transfected cells implies that this peptide may be tightly associated 
35 with the cell surface of ECM. As a first step toward determining which region(s) of the Shh 
protein may be responsible for this retention, a truncated form of mouse Shh deleted of all 
sequence downstream of amino acid 193 was expressed in COS cells. This protein contains 
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all of the sequences encode by exons one and two, as well as five amino acids derived for 
exon three. Since its predicted molecular mass (19.2 kDa) is very close to the observed 
molecular mass of the smaller processed form, the behavior of this protein would be expected 
to mimic that of the smaller processed form. This protein is detected at a very high level in 
5 the medium, even in the absence of heparin or suramin, and migrates at a position 
indistinguishable form that of the amino-terminal cleavage product generated from the full- 
length protein. In fact, virtually no protein is seen in the cell lysates, suggesting nearly 
quantitative release of the protein into the medium. This raises the possibility that the actual 
amino terminally processed form may extend a short distance beyond amino acid 193 and 
10 that these additional amino acids contain a cell surface-ECM retention signal. 

The influence of sequences located at the extreme amino and carboxy termini of 
mouse Shh on the behavior of the protein in transfected cells was examined using the amino 
terminus-specific antiserum. Expression of a mouse Shh construct lacking a signal peptide 
results in the accumulation of approximately 28-kDa protein, as well as a small amount of 
0 15 protein which comigrates with the smaller processed form. This implies that correct cleavage 
p of Shh requires targeting of the protein to the endoplasmic reticulum, since the bulk of the 
V J processed form of Shh expressed in the cytoplasm is cleaved at a new position that is 
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approximately 9kDa carboxy terminal to the normal cleavage site. Expression of a mouse 
P Shh protein engineered to terminate after amino acid 428 (lacking nine carboxy-terminal 

fe 20 amino acids [ACt]) results in the expected amino-terminal cleavage product; however, the 

? 

p efficiency of cleavage is significantly decreased compared with that seen with the wild-type 

^ protein. Therefore, sequences located at a distance from the proteolytic processing site are 

Jy able to affect the efficiency of processing. 

25 (v) Sonic hedgehog processing in embryonic tissues 

In order to determine whether the proteolytic processing of Shh observed in the 
different expression systems reflects the behavior of the protein in embryos, the amino 
terminus-specific mouse Shh antiserum was used to probe Western blots of various chicken 
and mouse embryonic tissues. A protein with an electrophoretic mobility identical to that of 

30 COS cell-synthesized amino terminally processed form is detected at a substantial level in the 
stomach and lung tissue and at a markedly lower level in the forebrain, midbrain, and 
hindbrain tissues of 15.5-day-postcoitum mouse embryos. These tissues have all been shown 
to express Shh RNA. The 19kDa peptide is not detected in liver or late limb tissues, which 
do not express Shh RNA. Thus, the proteolytic processing of Shh observed in cell culture 

35 also occurs in embryonic mouse tissue. 

The cross-reactivity of the amino terminus-specific mouse Shh antiserum with 
chicken Shh protein allowed for examination of expression of Shh in chicken embryonic 
tissue. The antiserum detects the 19-kDa amino terminally processed form of chicken Shh in 
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txansfected COS cells, as well as in two tissues which have been shown by whole-mount in 
situ hybridization and antiserum staining to express high levels of Shh RNA and protein, i.e., 
the posterior region of the limb bud and the ventral region of the anterior CNA (Riddle et al. 
(1993) Cell 75:1401-1416). Therefore, the expected proteolytic processing of Shh occurs in 
5 chicken embryonic tissues, and diffusion of the 19-kDa protein does not extend into the 
anterior limb buds and dorsal CNS. 



m 
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(v) Hedgehog Processing 

In summary, the results discussed above demonstrate that the mouse and chick Shh 
10 genes encode secreted glycoproteins which undergo additional proteolytic processing. Data 
indicate that this processing occurs in an apparently similar fashion in a variety of cell types 
suggesting that it is a general feature of the Shh protein, and not unique to any particular 
*» expression system. For mouse Shh 9 data indicate that both products of this proteolytic 

W processing are secreted. These observations are summarized in Figure 13. 

P! 

SI 15 It was observed that the 19 kDa amino peptide accumulates to a lower level in the 

medium than the 27 kDa carboxyl peptide. This may reflect inefficient secretion or rapid 
turnover of this species once secreted. Alternatively, the smaller form may associate with the 
^ cell surface or extracellular matrix components making it difficult to detect in the medium. 

The insensitivity of the secreted, larger form to Endo H is a common feature of secreted 
f 20 glycoproteins. During transit through the Golgi apparatus, the Asn-linked carbohydrate 
moiety is modified by a series of specific glycosidases (reviewed in Kornfeld, R. and 
& Kornfeld, S., (1985) Annu. Rev. Biochem 54:631-664; Tarentino, A.L. et al., (1989) Methods 
^' Cell Biol. 32:111-139). These modifications convert the structure from the immature "high 

mannose" to the mature "complex" type. At one step in this process, a Golgi enryme, a- 
25 mannosidase II, removes two mannose residues from the complex rendering it insensitive to 
Endo H (Kornfeld, R. and Kornfeld, S., (1985) Annu. Rev. Biochem 54:63 1 -664). 

The biochemical behavior of mouse Shh appears to be quite similar to that described 
for the Drosophila Hedgehog (Dros-HH) protein (Lee, J.L. et al., (1992) Cell 71:33-50; 
Tabata, T. et al., (1992) Genes & Dev. 6:2635-2645). In vitro translation of Drosophila hh 

30 mRNA, in the presence of microsomes, revealed products with molecular masses 
corresponding to full length protein, as well as to the product expected after cleavage of the 
predicted internal (Type II) signal peptide (Lee, J.L. et al., (1992) Cell 71:33-50). 
Interestingly, no additional, processed forms were observed. However, such forms could 
have been obscured by breakdown products migrating between 20 and 30 kDa. When an 

35 RNA encoding a form of the protein lacking the carboxy-terminal 61 amino acids was 
translated, no breakdown products were seen, but there is still no evidence of the proteolytic 
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processing observed with mouse Shh. A similar phenomenon has been observed in these 
experiments. A reduction in the extent of proteolytic processing is seen when a mouse Shh 
protein lacking 10 carboxy-terminal amino acids is translated in vitro or expressed in cos 
cells (data not shown). This suggests that sequences at the carboxy termini of Hh proteins act 
5 at a distance to influence the efficiency of processing. 

Recently, Lee et al. (Science 266:1528-1537, 1994) described the biochemical 
behavior of the Drosophila HH protein. Using region-specific antisera, they detected similar 
processed forms of HH in embryonic tissues, thus confirming studies in which processing of 
HH was observed in embryos forced to express high levels of HH from a heat shock 
10 promoter (Tabata and Komberg (1994) Cell 76:89-102). Thus, Drosophila HH is processed 
to yield a 19 kDa amino-terminal peptide and a 25 kDa carboxy-terminal peptide. 
Furthermore, Lee et al. concluded that the production of the processed forms occurs via an 
autocatalytic mechanism and identified a conserved histidine residue (at position 329, 

p. according to Lee et al. (Science 266:1528-1537, 1994)) which is required for self-cleavage of 

00 15 HH protein in vitro and in vivo. The significance of the proteolytic processing is 
demonstrated by the inability of self-processing-either because of mutation of this histidine 

*p residue or because of truncation of sequences at the extreme carboxy terminus-to carry out 

jjjj HH functions in Drosophila embryos. 

^ Their studies of the biochemical behavior of mouse and chicken Shh and mouse Jhh 

* 20 proteins correlate well with the Drosophila studies of Lee et al. (Science 266:1528-1537, 
Ejj 1994) in that the similar proteolytic processing of endogenous vertebrate proteins in 

Q embryonic tissues was demonstrated. Furthermore, it was demonstrated that the efficiency of 

^ processing depends on sequences located at the extreme carboxy terminus of mouse Shh. 

Interestingly, it has also been shown that he specificity of mouse Shh cleavage may depend 
25 on targeting of the protein to the secretory pathway, since a form lacing a signal peptide is 
processed into an approximately 28-kDa amino-terminal form. A similar protein is observed 
as the predominant species when it was attempted to express full-length mouse Shh in 
- bacteria (data no shown). Lee et al. (Science 266:1528-1537, 1994) have demonstrated that 
two zebra fish hedgehog proteins undergo proteolytic processing when translated in vitro, 
30 even in the absence of microsomal membranes. The electrophoretic mobilities of the 
processed peptides are consistent with cleavage occurring at a position similar to that of the 
Drosophila HH cleavage site. Furthermore, they showed that the cleavage fails to occur if 
the conserved histidine residue is mutated, arguing for an autoproteolytic mechanism similar 
to that of the Drosophila protein. However, the processing of mouse or chicken Shh protein 
35 translated in vitro was not detected unless microsomal membrane are included. Therefore, it 
is possible that correct proteolytic processing of vertebrate hedgehog proteins is dependent on 
specific incubation conditions or may require cellular factors in addition to Shh itself. 
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An additional correlation between the work presented here and that of Lee et aL 
(Science 266:1528-1537, 1994) concerns the different behaviors of the amino (smaller) and 
carboxy (larger) terminally processed forms of the hedgehog proteins. The evidence is 
presented that the 27kDa carboxy-terminal form diffuses more readily from expressing cells 
5 than the 19kDa ammo-terminal form, which seems to be retained near the cell surface. The 
polyanions heparin and suramin appear capable of releasing the amino peptide into the 
medium. Similarly, the amino-terminal form of Drosophila HH is more closely associated 
with the RNA expression domain in embryonic segments than is the carboxy-terminal form, 
and the amino-terminal form binds to heparin agarose beads. Therefore, the distinct 
1 0 behaviors of the different hedgehog peptides have been conserved across phyla. 

The observed molecular masses of the amino terminally processed forms of mouse 
and chicken Shh> mouse Ihh proteins, and Drosophila HH are between 19 and 20 kDa. 
Therefore, the predicted secondary proteolytic cleavage site would be located near the border 
q of the sequences encoded by the second and third exons. Interestingly, the region marks the 

{S 15 end of the most highly related part of the hedgehog proteins. The amino terminal (smaller) 
form would contain the most highly conserved portion of the protein. In fact, the amino acids 
4= encoded by exons one and two (exclusive of sequences upstream of the putative signal 

)£j peptide cleavage sites) share 69% identity between Drosophila Hh and mouse Shh 9 and 99% 

|p identity between chick and mouse Shh. Amino acid identity in the region encoded by the 

* 20 third exon is much lower 30% mouse to Drosophila and 71% mouse to chick (Echelard, Y. et 
§ aL, (1993) Cell 75:1417-1430). 

n 

Sj However, the boundary between sequences encoded by exons 2 and 3 is unlikely to be 



ry 
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the actual proteolytic processing site, because a Drosophila HH protein containing a large 
deletion which extends three amino acids beyond this boundary is still cleaved at the 

25 expected position in vitro (Lee et aL (1994) Science 266: 1 528-1 537). Moreover, the analysis 
of an amino-terminal mouse Shh peptide truncated at amino acid 193 (the fourth amino acid 
encoded by exon 3, described below) suggests that normal cleavage must occur downstream 
of this position. Close examination of hedgehog protein sequences reveals that strong 
sequence conservation between the Drosophila and vertebrate proteins continues for only a 

30 short distance into the third exon. If it is assumed that cleavage will generate an amino 
terminal product of no greater than 20 kDa, given the resolution of analysis, all of the data 
would indicate that cleavage occurs at 1 of the 10 amino acids within the mouse Shh 
positions 194-203, according to Echelard et al. (Cell 75:1417-1430, 1993). 



35 



(vi) Hedgehog Signalling 
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In order to satisfy the criteria for intercellular signaling, hedgehog proteins must be 
detected outside of their domains of expression. This has been clearly demonstrated for 
Drosophila HH. Using an antiserum raised against nearly full length Dros-HH protein, 
Tabata and Kornberg (Tabata, T. and Romberg, T.B., (1992) Cell 76:89-102) detect the 
5 protein in stripes that are slightly wider than the RNA expression domains in embryonic 
segments, and just anterior to the border of the RNA expression domain in wing imaginal 
discs. Similarly, Taylor, et. aL, (1993) Meek Dev. 42:89-96, detected HH protein in discrete 
patches within cells adjacent to those expressing hh RNA in embryonic segments using an 
antiserum directed against an amino-terminal portion of Hh which, based on the proteolytic 
10 processing data (Tabata, T. et aL, (1992) Genes & Dev. 6:2635-2645), is not likely to 
recognize the carboxyl cleavage product. 

The detection of Hh beyond cells expressing the hh gene is consistent with the 
phenotype of hh mutants. In these animals, cellular patterning in each embryonic 
p parasegment in disrupted resulting in an abnormal cuticular pattern reminiscent of that seen 

W 1 5 in wg mutants. Further analysis has revealed that the loss of hh gene function leads to loss of 
^ wg expression in a thin stripe of cells just anterior to the hh expression domain (Ingham, P.W. 

*p and Hidalgo, A., (1993) Development 1 17:283-291). This suggests that Hh acts to maintain 

Q wg expression in neighboring cells. The observation that ubiquitously expressed Hh leads to 

ji ectopic activation of wg supports this model (Tabata, T. and Kornberg, T.B., (1992) Cell 

* 20 76:89-102). In addition to these genetic studies, there is also indirect evidence that Hh acts at 
tj a distance from its site of expression to influence patterning of the epidermis (Heemskerk, J. 

P and DiNardo, S., (1 994) Cell 76:449-460). 

PJ 

y3 The apparent effect of Drosophila Hh on neighboring cells, as well as on those 

^ located at a distance from the site of hh expression is reminiscent of the influence of the 

25 notochord and floor plate on the developing vertebrate CNS, and of the ZPA in the limb. The 
notochord (a site of high level Shh expression) induces the formation of the floor plate in a 
contact dependent manner, while the notochord and floor plate (another area of strong Shh 
expression) are both capabie of inducing motorneurons at a distance (Placzek, M. et aL, 
(1993) Development 1 17:205-218; Yamada, T. et aL, (1993) Cell 73:673-686). 

30 Moreover ZPA activity is required not only for patterning cells in the extreme 

posterior of the limb bud where Shh is transcribed, but also a few hundred microns anterior of 
this zone. Several lines of evidence indicate that Shh is able to induce floor plate (Echelard, 
Y. et aL, (1993) Cell 75:1417-1430; Roelink, H. et aL, (1994) Cell 76:761-775) and mediate 
the signaling activity of the ZPA (Riddle, RJD. et aL, (1993) Cell 75:1401-1416). Since it 

35 has been shown that Shh is cleaved, it can be speculated that the processed peptides may have 
distinct activities. The smaller amino terminal form, which appears to be more poorly 
secreted, less stable or retained at the cell surface or in the extracellular matrix, may act 
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locally. In contrast, the larger carboxy terminal peptide could possibly function at a distance. 
In this way, Shh peptides may mediate distinct signaling functions in the vertebrate embryo. 
Alternatively, the carboxy-terminal peptide may be necessary only for proteolytic processing, 
with all signaling activity residing in the amino-terminal peptide. 



Example 7 

Sonic hedgehog andFgf-4 act through a signaling cascade and feedback loop to integrate 
growth and patterning of the developing limb bud 

(i) Experimental Procedures 
Cloning of Chicken Fgf-4 and Bmp-2 

A 246 bp fragment of the chicken Fgf-4 gene was cloned by PCR from a stage 22 
chicken limb bud library. Degenerate primers were designed against previously cloned Fgf-4 
and Fgf-6 genes: fgf5' (sense) AAA AGC TTT AYT GYT AYG TIG GIA THG G (SEQ ID 
No:38) and fg£3' (antisense) AAG AAT TCT AIG CRT TRT ART TRT TIG G (SEQ ID 
No:39). Denaturation was at 94°C for 2 min, followed by 30 cycles of 94°C for 30 sec, 50°C 
for 60 sec, and 72°C for 30 sec, with a final extension at 72°C for 5 min. The PCR product 
was subcloned into the Bluescript SK+ vector. A clone was sequenced and confirmed as Fgf- 
4 by comparison with previously published Fgf-4 genes and a chicken Fgf-4 gene sequence 
kindly provided by Lee Niswander. 

BMP-related sequences were amplified from a stage 22 posterior limb bud cDNA 
library prepared in Bluescript using primers and conditions as described by Basler, et al. 
(1993). Amplified DNAs were cloned and used to screen a stage 22 limb bud library 
prepared in Jl-Zap (Stratagene). Among the cDNAs isolated was chicken Bmp-2. Its identity 
was confirmed by sequence comparison to the published clones (Francis, et al., (1994) 
Development 120:209-218) and by its expression patterns in chick embryos. 

Chick Surgeries and Recombinant Retroviruses 

All experimental manipulations were performed on White Leghorn chick embryos (S- 
SPF) provided by SPAFAS (Norwich, Conn). Eggs were staged according to Hamburger and 
Hamilton (1951) J. Exp. Morph. 88:49-92. 

Viral supernatants of Sonic/RCAS-A2 or a variant containing an influenza 
hemaglutinin epitope tag at the carboxyl terminus of the hedgehog protein (Sonic! . 1/RCAS- 
A2, functionally indistinguishable from Sonic/RCAS-A2% were prepared as described 
(Hughes, et al., (1987) J. Virol 61:3004-13; Fekete and Cepko, (1993) Mol & Cell Biol 
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13:2604-13; Riddle, et aL, (1993) Cell 75:1401-16). For focal injections the right wings of 
stage 18-21 embryos were transiently stained with nile blue sulfate (0.01 mg/ml in Ringer's 
solution) to reveal the AER. A trace amount of concentrated viral supernatant was injected 
beneath the AER. 

The AER was removed using electrolytically sharpened tungsten wire needles. Some 
embryos had a heparin-acrylic bead soaked in FGF-4 solution (0.8 mg/ml; a gift from 
Genetics Institute) or PBS stapled to the limb bud with a piece of 0.025mm platinum wire 
(Goodfellow, Cambridge UK) essentially as described by Niswander et al, (1993) Cell 
75:579-87. 

Limbs which were infected with SonicfRCAS virus after AER removal were infected 
over a large portion of the denuded mesoderm to ensure substantial infection. Those embryos 
which received both an Fgf-4 soaked bead and virus were infected only underneath the bead. 

In Situ Hybridizations and Photography 

Single color whole mount in situ hybridizations were performed as described (Riddle, 
et aL, (1993) Cell 75:1401-16). Two color whole mount in situ hybridizations were 
performed essentially as described by Jowett and Lettice (1994) Trends Genet, 10:73-74. 
The second color detection was developed using 0.125mg/ml magenta-phos (Biosynth) as the 
substrate. Radioactive in situ hybridizations on 5jj.ni sections was performed essentially as 
described by Tessarollo, et al. (1992) Development 1 15:1 1-20. 

The following probes were used for whole mount and section in situ hybridizations: 
Sonic: l.Tkb fragment of pHH2 (Riddle, et al., (1993) Cell 75:1401-16). Bmp-2: 1.5 kb 
fragment encoding the entire open reading frame. Fgf-4: 250 bp fragment described above. 
Hox d-11: a 600 bp fragment, Hoxd-13: 400 bp fragment both including 5' untranslated 
sequences and coding sequences upstream of the homeobox. RCAS: 900 bp Sall-Clal 
fragment of RCAS (Hughes et al., (1987) J Virol 61 :3004-12). 

(ii) Relationship of Sonic to Endogenous Bmp-2 and Hoxd Gene Expression 

The best candidates for genes regulated by Sonic in vivo are the distal members of the 
Hoxd gene cluster, Hoxd-9 through -13, and Bmp-2. Therefore, the relationships of the 
expression domains of these genes in a staged series of normal chick embryos were analyzed. 
Hoxd-9 and Hoxd- 10 are expressed throughout the presumptive wing field at stage 16 
(Hamburger and Hamilton, (1951) J. Exp. Morph. 88:49-92), prior to the first detectable 
expression of Sonic at early stage 18. Hoxd-11 expression is first detectable at early stage 18, 
the same time as Sonic 9 in a domain coextensive with Sonic. Expression of Hoxd-12 and 
Hoxd-13 commence shortly thereafter. These results suggest that Sonic might normally 
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induce, directly or indirectly, the expression of only the latter three members of the cluster, 
even though all five are nested within the early limb bud. 

As limb outgrowth proceeds Sonic expression remains at the posterior margin of the 
bud. In contrast the Hoxd gene expression domains, which are initially nested posteriorly 
5 around the Sonic domain, are very dynamic and lose their concentric character. By stage 23 
the Hoxd- J 1 domain extends anteriorly and distally far beyond that of Sonic, while Hoxd- 13 
expression becomes biased distally and displaced from Sonic. 

While it is not clear whether Bmp-2 is expressed before Sonic (see Francis et. al., 
(1994) Development 120:209-218) Bmp-2 is expressed in a mesodermal domain which 
10 apparently overlaps and surrounds that of Sonic at the earliest stages of Sonic expression. As 
the limb bud develops, the mesodermal expression of Bmp-2 remains near the posterior limb 
margin, centered around that of Sonic, but in a larger domain than Sonic. This 
correspondence between Sonic and Bmp-2 expression lasts until around stage 25, much 
longer than the correspondence between Sonic and Hoxd gene expression. After stage 25 

9 1 5 Bmp-2 expression shifts distally and is no longer centered on Sonic. 

10 

(Hi) Relationship of Sonic to Induced Bmp-2 and Hoxd Gene Expression 

in 

P The feet that the expression domains of the Hoxd genes diverge over time from that of 

20 Sonic hedgehog implies that Sonic does not directly regulate their later patterns of expression, 
p This does not preclude the possibility that the later expression domains are genetically 

^ downstream of Sonic. If this were the case, exogenously expressed Sonic would be expected 

pj to initiate a program of Hoxd gene expression which recapitulates that seen endogenously. 

\D Therefore, the spatial distribution of Hoxd gene expression at various times following Sonic 

^ 25 misexpression was compared. The anterior marginal mesoderm of early bud (Stage 18-20) 
wings was injected at a single point under the AER with a replication competent virus that 
expresses a chicken Sonic cDNA. Ectopic Sonic expressed by this protocol leads to both 
anterior mesodermal outgrowth and anterior extension of the AFR. 

The Sonic and Hoxd gene expression domains in the infected limbs were analyzed in 
30 sectioned and intact embryos. Viral Sonic message is first detected approximately 18 hours 
after infection at the anterior margin, at the same time as, and approximately coextensively 
with, induced Hoxd-11. This suggests that Sonic can rapidly induce Hoxd- 11 expression and 
that the lag after injection represents the time required to achieve Sonic expression. By 35 
hours post infection distal outgrowth of infected cells combined with lateral viral spread 
35 within the proliferating cells leads to viral expression in a wedge which is broadest at the 
distal margin and tapers proximally. By this time, Hoxd-11 expression has expanded both 
antero-proximally and distally with respect to the wedge of Some-expressing cells, into a 
domain which appears to mirror the more distal aspects of the endogenous Hoxd-11 domain. 
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Weak Hoxd-13 expression is also detected at 35 hours in a subset of the Sonic expressing 
domain at its distal margin. 51 hours after infection the relationship of Sonic and Hoxd-11 
expression is similar to that seen at 35 hours, while the induced Hoxd-13 expression has 
reached wild type levels restricted to the distal portions of the ectopic growth. Thus the 
5 ectopic Hoxd expression domains better reflect the endogenous patterns of expression than 
they do the region expressing Sonic. This suggests that there are multiple factors regulating 
Hoxd expression but their actions lie downstream of Sonic. 

Since the endogenous Bmp-2 expression domain correlates well with that of Sonic, 
and Bmp-2 is induced by ZPA grafts, it was looked to see if Bmp-2 is also induced by Sonic. 
10 Bmp-2 is normally expressed in two places in the early limb bud, in the posterior mesoderm 
and throughout the AER (Francis, et aL, (1994) Development 120:209-218). In injected limb 
buds additional Bmp-2 expression is seen in both the anterior mesoderm and in the anteriorly 
extended AER. The domain of Bmp-2 expression is slightly more restricted than that of viral 
expression, suggesting a delay in Bmp-2 induction. Bmp-2 expression in both the mesoderm 
O. 15 and ectoderm is thus a downstream target of Sonic activity in the mesoderm. In contrast to 
p the expression domains of the Hoxd genes, the endogenous and ectopic Bmp-2 expression 

M domains correlate well with that of Sonic. This suggests that Bmp-2 expression is regulated 

JJJ more directly by Sonic than is expression of the Hoxd genes. 

^ 20 (iv) The AER and Competence to Respond to Sonic 

p 

Nl Ectopic activation of Hoxd gene expression is biased distally in virally infected 

^ regions, suggesting that ectodermal factors, possibly from the AER, are required for Hoxd 

ti3 gene induction by Sonic. To test this, Sonic virus was injected into the proximal, medial 

^ 25 mesoderm of stage 21 limb buds, presumably beyond the influence of the AER. Although 
the level of Sonic expression was comparable to that observed in distal injections, proximal 
misexpression of Sonic did not result in ectopic induction of the Hoxd genes or Bmp-2, nor 
did it result in any obvious morphological effect (data not shown). The lack of gene 
induction following proximal misexpression of Sonic suggests that exposure to Sonic alone is 
30 insufficient to induce expression of these genes. 

This was tested more rigorously by injection of Sonic virus into the anterior marginal 
mesoderm of stage 20/21 limb buds after the anterior half of the AER had been surgically 
removed. Embryos were allowed to develop for a further 36 to 48 hours before harvesting. 
During this time the AER remaining on the posterior half of the limb bud promotes almost 
35 wild type outgrowth and patterning of the bud. Gene expression was monitored both in 
sectioned and intact embryos. In the presence of the AFR, Sonic induces both anterior 
mesodermal proliferation and expression of Hoxd-11, Hoxd-13 and Bmp-2. In the absence of 
the overlying AER, Sonic does not induce either mesodermal proliferation or expression of 
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these genes above background. Signals from the AER are thus required to allow both the 
proliferative and patterning effects of Sonic on the mesoderm. 

Since application of FGF protein can rescue other functions of the AER such as 
promoting PD outgrowth and patterning, it was sought to determine whether FGFs might also 

5 promote mesodermal competence to respond to Sonic, FGF-4-soaked beads were stapled to 
AER-denuded anterior mesoderm which was infected with Sonic virus. Gene expression and 
mesodermal outgrowth were monitored as described previously. In the presence of both 
Sonic virus and FGF-4 protein, Hoxd-11, Hoxd-13 and Bmp-2 expression are all induced. 
The expression levels of the induced genes are similar to or greater than the endogenous 

10 expression levels, and are equivalent in magnitude to their induction in the presence of the 
AER. Thus Fgf-4 can induce the competence of the mesoderm to respond to Sonic. 

Sonic alone is insufficient to induce either gene expression or mesodermal 
proliferation in the absence of the AER, while the combination of Sonic and FGF-4 induces 
both proliferation and gene expression. It was than asked whether FGF-4 alone has any 

15 effect on gene induction or mesodermal proliferation. Application of FGF-4 in the absence 
of Sonic virus does not induce Hoxd or Bmp-2 gene expression above control levels, however 
FGF-4 alone induces mesodermal outgrowth. These results suggest that mesodermal gene 
activation requires direct action of Sonic on the mesoderm and that proliferative response to 
Sonic is indirect, due to the induction of FGFs. 

20 (v) Sonic Induces Polarized Fgf-4 Expression in the AER 

Fgf-4 is expressed in a graded fashion in the AER of the mouse limb bud, with 
maximal expression at the posterior region of the AER tapering to undetectable levels in the 
anterior ridge (Niswander and Martin, (1992) Development 114:755-68). Therefore, it was 

25 appropriate to investigate whether Fgf-4 is asymmetrically expressed in the chick AER, and 
whether its expression is induced by Sonic. A fragment of the chicken Fgf-4 gene was cloned 
from a stage 22 chicken limb library by PCR using degenerate primers designed from mouse 
Fgf-4 and Xenopus e-Fgf sequence; based on information provided by L. Niswander and G. 
Martin. Assignment of gene identity was based on primary sequence as well as comparison 

30 of expression patterns with that of murine Fgf-4 (Niswander and Martin, (1992) Development 
1 14:755-68). Whole mount in situ hybridization analysis showed strong limb expression of 
chick Fgf-4 in the AER. Fgf-4, like Bmp-2, is expressed all the way to the posterior border of 
the AER, but its anterior domain ends before the morphological end of the AER creating a 
posterior bias that has also been observed by Niswander et al., (1994) Nature (in press). 

35 Expression is first detected in the distal AER at about stage 18. As outgrowth proceeds the 
posterior bias develops. Expression peaks around stage 24/25 and then fades by stage 28/29. 

The expression domain of Fgf-4 becomes posteriorly biased as Sonic is expressed in 
the posterior mesoderm. This observation is consistent with Sonic influencing the expression 
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of Fgf-4 in the posterior AER. To test the effect of Sonic on Fgf-4 expression in the AER, 
stage 18-20 embryos were infected with Sonic virus in a single point at their anterior margin 
beyond the anterior limit of the AER. The embryos were harvested one to two days later, 
when an extension of the anterior AER became apparent. The expression of Fgf-4 was 
analyzed by in situ hybridization . Fgf-4 expression is induced in the anteriormost segment 
of the AER, in a region which is discontinuous with the endogenous expression domain, and 
overlies the domain of viral Sonic infection. This result contrasts with the Bmp-2 expression 
induced in the extended AER, which is always continuous with the endogenous expression 
domain. The asymmetry of the induced Fgf-4 expression indicates that Sonic polarizes the 
extended AER, much as a ZPA graft does (Maccabe and Parker, (1979) J. Embryol Exp. 
Morph. 53:67-73). Since FGFs by themselves are mitogenic for limb mesoderm, these 
results are most consistent with Sonic inducing distal proliferation indirectly, through the 
induction of mitogens in the overlying AER. 

(vi) Reciprocal Regulation of Sonic by Fgf-4 

Sonic thus appears to be upstream of Fgf-4 expression in the AER. However, since 
the AER is required to maintain polarizing activity in the posterior mesoderm (Vogel and 
Tickle, (1993) Development 19:199-206; Niswander et al., (1993) Cell 75:579-87), Sonic 
may also be downstream of the AER. If Sonic is regulated by the AER and the AER by 
Sonic, this would imply that they are reinforcing one another through a positive feedback 
loop. 

To test whether the AER dependence of ZPA activity is controlled at the level of 
transcription of the Sonic gene, Sonic expression following removal of the AER from the 
posterior half of the limb bud was assayed. Sonic expression is reduced in an operated limb 
compared to the contralateral control limb within ten hours of AER removal, indicating that 
Sonic expression is indeed AER dependent The dependence of Sonic expression on signals 
from the AER suggests that one of the functions of the AER is to constrain Sonic expression 
to the more distal regions of the posterior mesoderm. 

In addition to their mitogenic and competence-inducing properties, FGFs can also 
substitute for the AER to maintain the ZPA. In order to test whether FGFs can support the 
expression of Sonic , beads soaked in FGF-4 protein were stapled to the posterior-distal tips of 
limb buds after posterior AER removal. Embryos were assayed for Sonic expression 
approximately 24 hours later, when Sonic expression is greatly reduced in operated limb buds 
which had not received an FGF-4 bead. Strong Sonic expression is detectable in the posterior 
mesoderm, slightly proximal to the bead implant, and reflecting the normal domain of Sonic 
expression seen in the contralateral limb. With the finding that FGF-4 can maintain Sonic 
expression, the elements required for a positive feedback loop between Sonic expression in 
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the posterior mesoderm and Fgf-4 expression in the posterior AER are established (see also 
Niswander et al. (1994) Nature (in press)). 

The induction of Bmp-2 expression by Sonic requires signals from the AER, and its 
domain correlates over time with that of Sonic. Therefore, it was interesting to learn if the 
continued expression of Bmp-2 also requires signals from the AER, and if so, whether they 
could be replaced by FGF-4. To test this, Bmp-2 expression following posterior AER 
removal, and following its substitution with an FGF-4 bead was assayed; Bmp-2 expression 
fades within hours of AER removal, and can be rescued by FGF-4. These data indicate that 
the maintenance of Bmp-2 expression in the posterior mesoderm, like that of Sonic, is 
dependent on signals from the AER, which are likely to be FGFs. 

(vii) The Mesodermal Response to Sonic 

It has been found that only mesoderm underlying the AER is responsive to Sonic, 
apparently because the AER is required to provide competence signals to the limb mesoderm. 
Fgf-4 9 which is expressed in the AER, can substitute for the AER in this regard, and thus 
might act in combination with Sonic to promote Hoxd and Bmp-2 gene expression in the 
mesoderm. FGFs may be permissive factors in a number of instructive pathways, as they are 
also required for activins to pattern Xenopus axial mesoderm (Cornell and Kimelman, (1994) 
Development 120:2187-2198; LaBonne and Whitman, (1994) Development 120:463-472). 

The induction of Hoxd and Bmp-2 expression in response to Sonic and FGF-4 in the 
absence of an AER suggests that the mesoderm is a direct target tissue of Sonic protein. 
Since Sonic can induce Fgf-4 expression in the AER, it follows that Sonic also acts indirectly 
on the mesoderm through the induction of competence factors in the AER. 

(viii) Downstream Targets and a Cascade of Signals Induced by Sonic 

The five AbdB-like Hoxd genes, Hoxd-9 through -13, are initially expressed in a 
nested pattern centered on the posterior of the limb bud, a pattern which suggests they might 
be controlled by a common mechanism (Dolle, et al., (1989) Cell 75:431-441; Izpisua- 
Belmonte, et al., (1991) Nature 350:585-9). The analysis of the endogenous and induced 
domains of Hoxd gene expression suggests that Sonic normally induces expression of Hoxd- 
11, -12 and -13. In contrast it was found that Hoxd-9 and -10 expression initiate before Sonic 
mRNA is detectable. This implies that at least two distinct mechanisms control the initiation 
of Hoxd gene expression in the wing bud, only one of which is dependent on Sonic. 

Several observations suggest that the elaboration of the Hoxd expression domains is 
not controlled directly by Sonic, but rather by signals which are downstream of Sonic. The 
Hoxd expression domains rapidly diverge from Sonic, and evolve into several distinct 
subdomains. Moreover these subdomains appear to be separately regulated, as analysis of the 
murine Hoxd-11 gene promoter suggests that it contains independent posterior and distal 
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elements (Gerard, et al., (1993) Embo. J. 12:3539-50). In addition, although initiation of 
Hoxd-U through -13 gene expression is dependent on the AER, their expression is 
maintained following AER removal (Izpisua-Belmonte, et al., (1992) Embo. J. 11:1451-7). 
As Sonic expression fades rapidly under similar conditions, this implies that maintenance of 
Hoxd gene expression is independent of Sonic. Since ectopic Sonic can induce a 
recapitulation of the Hoxd expression domains in the limb, it can be concluded that although 
indirect effectors appear to regulate the proper patterning of the Hoxd expression domains, 
they are downstream of Sonic. Potential mediators of these indirect effects include Bmp-2 in 
the mesoderm and Fgf-4 from the AER. 

In contrast to the Hoxd genes, Bmp-2 gene expression in the posterior limb mesoderm 
appears to be continually regulated by Sonic. It was found that both endogenous and ectopic 
Bmp-2 expression correspond to that of Sonic. Furthermore, continued Bmp-2 expression is 
dependent on the AER and can be rescued by FGF-4. It is likely that this is an indirect 
consequence of the feet that Sonic expression is also maintained by the AER and can be 
rescued by FGF-4. In fact, Bmp-2 expression might be a direct response of cells to secreted 
Sonic protein. The differences between Bmp-2 and Hoxd gene expression suggest that 
multiple pathways downstream of Sonic regulate gene expression in the mesoderm. 

Bmp-2 itself is a candidate for a secondary signaling molecule in the cascade of 
patterning events induced by Sonic. Bmp-2 is a secreted molecule of the TGF-p* family and 
its expression can be induced by Sonic. This appears to be an evolutionary conserved 
pathway, as HH, the Drosophila homolog of Sonic, activates the expression of dpp, the 
homolog of Bmp-2, in the eye and wing imaginal discs (Heberlein, et al., (1993) Cell 75:913- 
26; Ma, et al., (1993) Cell 75:927-38; Tabata and Kornberg, (1994) Cell 76:89-102). 
Expression of HH is normally confined to the posterior of the wing disc. Ectopic expression 
of HH in the anterior of the disc results in ectopic expression of dpp and ultimately in the 
duplication of wing structure with mirror image symmetry (Bassler and Struhl, (1994) Nature 
368:208-214). This effect is slrikingly parallel to the phenotypic results of ectopic expression 
of Sonic in the chick limb. 

(ix) Regulation of Sonic Expression 

Sonic expression is activated in the posterior of the limb bud very early during 
mesodermal outgrowth (Riddle et al., (1993) Cell 75:1401-16). The factors which initiate 
this localized expression are not yet identified but ectopic expression of Hoxb-8 at the 
anterior margin of the mouse limb bud results in the activation of a second domain of Sonic 
expression under the anterior AER (Charite el al., (1994) Cell 78:589-601). Since retinoic 
acid is known to be able to induce the expression of Hoxb-8 and other Hox genes in vitro 
(Mavilio et al., (1988) Differentiation 37:73-79) it is possible that endogenous retinoic acid 
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acts to make cells competent to express Sonic by inducing expression of upstream Hox genes, 
either in the very early limb bud or in the flank prior to the limb bud formation. 

Several lines of evidence suggest that once induced Sonic expression is dependent on 
signals from the posterior AER. Following its initiation in the posterior limb mesoderm, the 
5 Sonic expression domain moves distally as the limb bud grows out, always remaining 
subjacent to the AER. Similarly, Sonic expression can also be induced on the anterior margin 
of the limb bud by implantation of a retinoic acid bead, but the induced ectopic expression is 
limited to the mesoderm directly underlying the AER (Riddle, et aL, (1993) Cell 75:1401-16). 
In addition, ZPA activity fades rapidly following removal of the AER (Niswander, et al., 
10 (1993) Cell 75:579-87; Vogel and Tickle, (1 993) Development 1 19:199-206), and ZPA grafts 
only function when placed in close proximity to the AER (Tabin, (1991) Cell 66:199-217; 
Tickle, (1991) Development Supp. 1:113-21). The observation that continued Sonic 
expression depends on signals from the posterior AER reveals the mechanism underlying 
p these observations. 

15 The reliance of Sonic expression on AER-derived signals suggests an explanation for 

ff] the distal shift in Sonic expression during limb development (Riddle et al., (1993) Cell 

jj 75:1401-16). Signals from the AER also promote distal outgrowth of the mesodermal cells 

tH of the progress zone, which in turn results in the distal displacement of the AER Hence, as 

^ maintenance of Sonic expression requires signals from the AER, its expression domain will 

$ 20 be similarly displaced. 

P. It was found that replacement of the AER with FGF-4 soaked beads results in the 

q maintenance of Sonic expression. This result is consistent with the previous findings that 

W ZPA activity can be maintained in vivo and in vitro by members of the FGF family 

(Anderson, et al., (1993) Development 117:1421-33; Niswander et al., (1993) Cell 75:1401- 
25 16 ; Vogel and Tickle, (1993) Development 1 19:199-206). Since Fgf-4 is normally expressed 
in the posterior AER, these results suggest that Fgf-4 is the signal from the ectoderm 
involved in maintaining Sonic expression. 

(x) Sonic and Regulation and Maintenance of the AER 
30 Sonic can induce anterior extensions of the AER which have an inverted polarity 

relative to the endogenous AER. This polarity is demonstrated by examining the expression 
of two markers in the AER. In normal limbs Bmp~2 is expressed throughout the AER, while 
Fgf-4 is expressed in the posterior two thirds of the AER. In the extended AER resulting 
from ectopic Sonic expression, Bmp-2 is again found throughout the AER, while Fgf-4 
35 expression is biphasic, found at either end of the AER, overlying the anterior and posterior 
mesodermal domains expressing Sonic. These results are consistent with previous 
observations that antero-posterior polarity of the AER appears to be regulated by the 
underlying mesoderm, and that ZPA grafts lead to the induction of ectopic, polarized AER 
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tissue (Maccabe and Parker, (1979) J. Embryol Exp. Morph. 53:67-73). Our results also 
suggest that the normal AP polarity of the AER is a reflection of endogenous Sonic 
expression. The induced AER is sufficient to promote complete PD outgrowth of the induced 
structures (Riddle et al., (1993) Cell 75:1401-16). Hence whatever factors are necessary to 
5 maintain the AER are also downstream of Sonic. 



(xi) A Positive Feedback Loop Between Sonic and Fgf-4 



10 



The induction of Fgf-4 expression by Sonic in the ectopic AER, and the maintenance 
of Sonic expression by FGF-4 suggest that Sonic and Fgf-4 expression are normally sustained 
by a positive feedback loop. Such a feedback loop would allow the coordination of 
q mesodermal outgrowth and patterning. This coordination is possible because Sonic patterns 

13 15 mesodermal tissue and regulates Fgf-4 expression, while FGF-4 protein induces mesodermal 
fj proliferation and maintains Sonic expression. Moreover mesodermal tissue can only be 

*p patterned by Sonic in the context of a competence activity provided by F8f-4. Thus 

W patterning is always coincident with proliferation. 

C3 ^ 
yj It remains possible that exogenously applied Fgf-4 might be mimicking the activity of 

* 20 a different member of the FGF family. For example, Fgf-2 is expressed in the limb 

^ mesoderm and the AER (Savage et al., (1993) Development Dynamics 198:159-70) and has 

p similar effects on limb tissue as Fgf-4 (Niswander and Martin, (1993) Nature 361:68-71; 

JJj Niswander, et al., (1993) Cell 75:579-87; Riley, et al., (1993) Development 118:95-104; 

Fallon, etal., (1994) Science 264:104-7). 



25 



(xii) Coordinated Regulation of Limb Outgrowth and Patterning 



Patterning and outgrowth of the developing limb are known to be regulated by two 
major signaling centers, the ZPA and AER. The identification of Sonic and FGFs as 

30 molecular mediators of the activities of the ZPA and AER has allowed for dissociation of the 
activities of these signaling centers from their regulation, and investigation of the signaling 
pathways through which they function. 

The results presented above suggest that the ability of cells to respond to Sonic 
protein is dependent on FGFs produced by the AER. It was also found that Sonic induces a 

35 cascade of secondary signals involved in regulating mesodermal gene expression patterns. In 
addition evidence was found for a positive feedback loop initiated by Sonic, which maintains 
expression of Sonic in the posterior mesoderm and Fgf-4 in the AER. The feedback loop 
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described suggests a mechanism whereby outgrowth and patterning along the AP and PD 
axes of the limb can be coordinately regulated. 

The results described above further suggest that Sonic acts as a short range signal 
which triggers a cascade of secondary signals whose interplay determines the resultant 
5 pattern of structures. The data suggest a number of inductive pathways that can be combined 
: ^ to generate a model (Figure 14) which describes how Sonic, in coordination with the AER, 
acts to pattern mesodermal tissues along the anterior-posterior limb axis, while 
simultaneously regulating proximal-distal outgrowth. 

Following its induction, Sonic signals to both the limb ectoderm and mesoderm. 
10 Sonic imposes a distinct polarity on the forming AER, including the posteriorly biased 
expression of Fgf-4, and the AER becomes dependent on continued Sonic expression. The 
mesoderm, as long as it is receiving permissive signals from the overlying ectoderm, 
responds to the Sonic signal by expressing secondary signaling molecules such as Bmp-2 and 
p by activating Hoxd genes. Bmp-2 expression is directly dependent on continued Sonic 

W 15 expression, while the continued expression of the Hoxd genes, rapidly becomes Sonic. 
jg independent. In a reciprocal fashion, maintenance of Sonic hedgehog expression in the 
*p posterior mesoderm becomes dependent on signals from the AER. Since the factors 

Vj expressed by the AER are not only required for the maintenance of Sonic expression and 

activity, but are also mitogenic, growth and patterning become inextricably linked. 
20 Coordination of limb development through interdependent signaling centers forces the AP 
and PD structures to be induced and patterned in tandem. The pathways elucidated herein 
thus provide a molecular framework for the controls governing limb patterning 
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Example 8 

25 Sonic f BMP-4, and Hox Gene Expression Suggest a Conserved Pathway in Patterning the 

Vertebrate and Drosophila Gut 



(i) Experimental Procedure 
30 In Situ Hybridization and Photography 

BMP probes were isolated using primers designed to amplify members of the TGF- 
and BMP families (Basler, K. et al., (1993) Cell 73:687-702, eight independent 120 bp BMP 
fragments were amplified from a stage 22 chicken posterior limb bud plasmid cDNA library. 
35 These fragments were pooled and used to screen an unamplified stage 22 limb bud lambda 
zap cDNA library constructed as in Riddle et al., (1993) Cell 75:1401-16 . Among the BMP 
related clones isolated were an approximately 1.9 kb cDNA clone corresponding to chicken 
BMP-2 and an approximately 1.5 kb cDNA clone corresponding to chicken BMP-4. Both 
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clones contain the* entire coding regions. The Sonic clone was obtained as described in 
Riddle et al, (1993) Cell 75:1401-16. Digoxigenin-UTP labeled RNA probes were 
transcribed as per Riddle et al., (1993) Cell 75:1401-16. Briefly, harvested chick embryos 
were fixed overnight in 4% paraformaldehyde, washed in PBS then processed for whole 
mount in situ hybridization methods are per Riddle et al., (1993)Ce// 75:1401-16. Embryos 
were photographed from either ventral or dorsal surfaces under transmitted light using a 
Nikon zoom stereo microscope with Kodak Ektar 100 ASA film. Whole mount in situ 
hybridization embryos and viscera were processed for sectioning as described in Riddle et al., 
(1993)Ce// 75:1401-16. 15-25 |im transverse sections were air dried and photographed with 
brightfield or numarski optics using a Zeiss Axiophot microscope and Kodak Ektar 25 ASA 
film. 

Chick Embryos and Recombinant Retroviruses 

A retroviral vector engineered to express a full length cDNA of chicken Sonic, as in 
Riddle et al. (1993) Cell 75:1401-16, was injected unilaterally into stage 8-13 chicken 
embryos targeting the definitive endoderm at the mid-embryo level. At this stage the CIP has 
not formed and neither Sonic nor BMP-4 are expressed in the region injected. Injections were 
performed on the ventral surface on embryos cultured with their ventral surface facing up 
(New, D.A.T. (1955) Embryol Exp. Morph. 3:320-31. Embryos were harvested 18-28 hours 
after injection and prepared for whole mount in situ hybridization (see above description of in 
situ experiment), hybridized with Sonic or BMP-4 digoxigenin labeled probes. 

In Situ Hybridization with Hox Genes 

Cloned cDNA of the chicken homologues of ifoxa-9,-10,-1 1,-13; 6-9, c-9,-10,-1 1; d- 
9,-10,-ll,-12,and -13 were used to transcribe digoxigenen-UTP labeled riboprobes for whole 
mount in situ hybridization. Domestic chick embryos were harvested into PBS and 
eviscerated. The visceral organ block was fixed in 4% paraformaldehyde overnight and 
processed for whole mount in situ hybridization. Methods and photographic technique as 
described above. 

(ii) Expression of Sonic and BMP-4 in Stage 13 Chick Embryos Determined by Whole Mount 
In Situ Hybridization 

Chick gut morphogenesis begins at stage 8 (Hamberger and Hamilton, (1987) Nutr. 
6:14-23 with a ventral in-folding of the anterior definitive endoderm to form the anterior 
intestinal portal (AIP) (Romanoff, A.L., (1960) The Avian Embryo, The Macmillan Co., NY. 
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This lengthens posteriorly forming the foregut. A second wave of endodermal invagination is 
initiated posteriorly at stage 13, creating the caudal intestinal portal (CIP). The CP extends 
anteriorly forming the hindgut. Sonic expression, previously noted in the endoderm of the 
vertebrate gut (Riddle et al., (1993) Cell 75:1401-16; Echelard et al., (1993) Cell 75:1417- 
1430), is expressed early in a restricted pattern in the endodermal lips of the AIP and CIP. 
Somc'expression is detected in the endoderm of the AIP and CIP in pre gut closure stages. At 
later stages, stage 28 embryos, Sonic is expressed in the gut in all levels (fore-, mid-, and 
hind-gut) restricted to the endoderm. Sonic is known to be an important inductive signal in 
other regions of the embryo including the limb bud (Riddle et al., (1993) Cell 75:1401-16) 
and neural tube (Echelard et al., (1993) Cell 75:1417-1430; Kraus et al., (1994) Cell 75:1437- 
1444; Roelink et al., (1994) Cell 76:761-775). Since primitive gut endoderm is known to 
cause' gut-specific mesodermal differentiation when combined with non-gut mesenchyme 
(Haffen et al., (1987) Nutr. 6:14-23), we speculated that Sonic might function as an inductive 
signal to the visceral mesoderm. A potential target gene for the action of Sonic was 
suggested by analogy to the Drosophila imaginal discs where HH, the homologue of 
vertebrate Sonic, activates the expression of the TGF-& related gene dpp in adjacent cells 
(Tabata abd Kornberg, (1994) Cell 76:89-102; Heberlein et al., (1993) Cell 75:913-926; Ma 
et al., (1993) Cell 75:913-926; Basler et al., (1993) Cell 73:687-702). There are two 
vertebrate homologues of dpp, BMP-2 and BMP-4. The earliest detectable expression of 
BMP-4 occurs simultaneously with the first observable expression of Sonic in the developing 
gut. BMP-4 is expressed in a domain abutting Sonic at the AIP and the CIP, but is restricted 
to the adjacent ventral mesoderm. BMP-4 gut expression persists into later stage embryos, 
stage 33 embryos, in the visceral mesoderm only. The tissue restricted expression of both 
genes is maintained in all stages studied. BMP-2 is not expressed in the gut at the AIP or 
CIP, but is expressed in clusters of cells in the gut mesoderm in later stages, a pattern distinct 
from that of BMP-4. 

(Hi) Ectopic Expression of Sonic Induces Ectopic Expression of BMP-4 in Mesodermal 
Tissues of the Developing Chick 

To test whether Sonic is capable of inducing BMP-4 in the mesoderm we an ectopic 
expression system previously used to study the role of- Sonic in limb development was 
utilized (Riddle et al., (1993) Cell 75:1401-16). A replication competent retrovirus 
engineered to express Sonic 'was injected unilaterally into the presumptive endoderm and 
visceral mesoderm at mid-embryo positions in stage 8-13 chick embryos in vitro (New, 
D.A.T. (1955) Embryol. Exp. Morph. 3:320-321). When embryos were examined by in situ 
hybridization 18-26 hours later, the normal wild type expression of Sonic is detected at the 
AIP, CIP, and in the midline (neural tube and notochord). Ectopic Sonic expression is 
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present unilaterally on the left ventral surfece. Also, wild type Sonic expression is seen in the 
floor plate of the neural tube and notochord. Ectopic expression is seen unilaterally in the 
visceral endoderm, its underlying splanchnic mesoderm, and somatic mesoderm. BMP-4 
expression can be seen induced in the mesoderm at the site of injection, in addition to its 

5 normal expression in the mesoderm of the CIP. Wild type BMP-4 expression is seen in the 
most dorsal aspects of the neural tube and symmetrical lateral regions adjacent to the neural 
tube. Induced BMP-4 expression is present unilaterally in the splanchnic mesoderm at the 
site of Sonic viral injection, and not in the visceral endoderm. 

Since BMP-4 is, itself, a secreted protein, it could function as a secondary signal in an 

10 inductive cascade, similar to the signal cascades from HH to dpp in Drosophila imagmal 
discs (Tabata abd Romberg, (1994) Cell 76:89-102; Heberlein et al., (1993) Cell 75:913-926; 
Ma et al., (1993) Cell 75:913-926; Basler et al., (1993) Cell 73:687-702) and from Sonic to 
BMP-2 in the limb bud. In the gut, BMP-4 could act as a secondary signal either as part of a 
feedback loop to the endoderm or within the visceral mesoderm. This latter possibility is 

15 consistent with the finding that in mice homozygous for a deletion in the BMP-4 gene, the 
ventral mesoderm fails to close. 

(iv) Expression ofHox Genes in the Developing Chick Gut 



® 20 There is a striking parallel between the apparent role of Sonic as an endoderm-to- 

h mesoderm signal in early vertebrate gut morphogenesis and that of its Drosophila 

^ homologue, HH. HH ( like Sonic) is expressed in the Drosophila gut endoderm from the 

| earliest stages of morphogenesis (Taylor et al., (1993) Meek Dev. 42:89-96). Its putative 

*0 receptor, patched, is found in the visceral mesoderm implicating HH (like Sonic) in 

^ 25 endodermal-mesodermal inductive interactions. This led to consideration whether other 
genes known to be involved in regulating Drosophila gut development might also play a role 
in regulating chick gut morphogenesis. Regionally specific pattern in' Drosophila gut 
endoderm is regulated by a pathway involving restricted expression of homeotic genes in the 
mesoderm (McGinnis and Krumlauf, (1992) Cell 68:283-302). Although the basis for 
30 patterning the vertebrate gut is poorly understood, in several other regions of the embryo Hox 
genes have been implicated as key regulators of patterns. Vertebrate Hox genes are expressed 
in overlapping anteroposterior domains which correlate with structural boundaries in the 
developing hindbrain, vertebrae, and limbs (McGinnis and Krumlauf, (1992) Cell 68:283- 
302). Whole mount in situ hybridization was used to test whether these genes are also 
35 expressed in the developing vertebrate hindgut and whether their domains of expression 
correlate with morphologic borders of the chick gut. 

Lumenal gut differentiation creates three morphologically and physiologically distinct 
regions: fore-, mid-, and hind- gut The fore-gut and hind-gut are the derivatives of the 
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primitive gut tubes initiated at the AIP and CIP respectively. Ultimately these tubes meet and 
fuse at the yolk stalk around stage 24-28. The midgut is formed from both foregut and 
hindgut primordia, just anterior and posterior to the yolk stalk. 

The most posterior derivative of the hindgut is the cloaca, the common gut-urogenital 
5 opening. The rest of the hindgut develops into the large intestine. The niidgut/hindgut 
border is demarcated by a paired tubal structure, the ceca (analogous to the mammalian 
appendix), which forms as budding expansions at the midgut/hindgut border at stage 19-20. 
Anterior to the ceca, the midgut forms the small intestine. 

The expression pattern of the 5' members of the Hox gene clusters in the chick 
[0 hindgut by whole mount in situ hybridization was studied. Hox gene expression patterns in 
the gut are dynamic. They are initially expressed (by stage 10) in broad mesodermal domains 
extending anteriorly and laterally. Later they become restricted. By stage 25, the Abd-B like 
genes of the Hoxa and Hoxd cluster are regionally restricted in their expression in hindgut 
mesoderm. The most anteriorly expressed gene, Hoxa-% has an anterior border of expression 
15 within the mesoderm of the distal midgut (to a point approximating the distal third of the 
midgut length). Each successive gene within the A and D Hox clusters has a more posterior 
domain of expression. Hoxa-10, Hoxd-9 and Hoxd-10 are restricted in their expression to the 
ceca. Hoxa-U and Hoxd-U have an anterior limit of expression in the mid-ceca at the 
approximate midgut/hindgut boundary (Romanoff, A.L. (1960) The Avian Embryo, The 
20 Macmillan Co. NY). Hoxd-\2 has an anterior limit at the posterior border of the ceca and 
extends posteriorly throughout the hindgut to the cloaca. Hoxa-U and Hoxd-U are 
expressed in the most posteriorly restricted domain, in the ventral mesoderm surrounding the 
cloaca. Hoxa-U and Hoxd-U are the only Abd-B like genes which are also expressed within 
the gut endoderm, from the ceca to the cloaca. 
25 The only member of the B or C Hox clusters which we found to be expressed in the 

hindgut is Hoxc-9. The expression of Hoxc-9 overlaps with its paralogues Hoxa-9 and Hoxd- 
9 in the midgut mesoderm, but has a sharp posterior boundary, complementary to Hoxa-U 
and Hoxd- 1 1 in the mid-ceca. 

The restricted expression of the Abd-B like Hox genes appear to demarcate the 
30 successive regions of the gut which will form the cloaca, the large intestine, the ceca, the 
mid-ceca at the midgut/hindgut border, and the lower portion of the midgut (perhaps 
identifying that portion of the midgut derived from the posterior gut tube3). Moreover, these 
molecular events presage regional distinctions. Expression of all Hox genes could be 
detected by stage 14, well before the hindgut lumen is closed (by stage 28) and is maintained 
35 in subsequent stages studied. Cytodifferentiation of we hindgut mesoderm and epithelium 
begins later, at stages 29-31 (Romanoff, A.L. (1960) The Avian Embryo, The Macmillan Co. 
NY). 
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These results suggest that specific Hox genes might be responsible for regulating 
morphogenesis of the gut Consistent with this, there is an apparent homeotic alteration in 
the gut of a transgenic mouse in which the anterior limit of expression of /fcxc-8 is shifted 
rostrally: a portion of foregut epithelium mis-differentiates as midgut (Pollock and Bieberich, 
5 (1992) Cell 71:911-923). 



10 (v) Conservation in the Expression of Regulatory Genes Involved in the Formation of 
Vertebrate and Drosophila Gut 

There is an intriguing parallel between the expression patterns of Sonic, BMP -4, and 
the Hox genes in the vertebrate gut and those of their homologues during Drosophila gut 
P 15 morphogenesis (Figure 15). This conservation is of particular interest because in Drosophila 
jf» the role played by these genes has been clarified genetically. HH (like its vertebrate 

^ homologue, Sonic) is expressed at the earliest stages in the gut endoderm and may be a signal 

4* to visceral mesoderm (Taylor et aL, (1993) Meek Dev. 42:89-96). Nothing is known directly 

2=1 of the relationship between HH expression and activation of expression of other genes in the 

=£! 20 Drosophila gut. However, in Drosophila imaginal discs, HH is known to activate the 
|Lj expression of dpp in a signaling cascade (Kraus et aL, (1994) Cell 75:1437-1444; Heberlein 

%A et aL, (1993) Cell 75:913-926; Ma et aL, (1993) Cell 75:913-926; Basler et aL, (1993) Cell 

P 73:687-702). Later in gut development, the production of dpp in the mesoderm contributes to 

J the regulation of the expression of homeotic genes in both the mesoderm and the endoderm 

01 25 (Bienz, M. (1994) 77G 10:22-26). Drosophila homeotic genes are expressed in the gut 
visceral mesoderm and their expression is known to determine the morphologic borders of 
the midgut This involves proper induction of gene expression in the adjacent endoderm, one 
of the mediators of the interaction is dpp (Bienz, M. (1994) TIG 10:22-26). If HH is required 
for the ultimate activation of the homeotic genes in the Drosophila midgut, this would 
30 parallel the situation in the vertebrate limb bud where Sonic functions as an upstream 
activator of the Hox genes (Riddle et aL, (1993) Cell 75:1401-1416), perhaps in a signaling 
cascade involving BMP-2. 

The extraordinary conservation in the expression of regulatory genes in the vertebrate 
and Drosophila gut strongly suggests a conservation of patterning mechanisms. Pathways 
35 established by genetic studies in Drosophila provide direct insights into the molecular basis 
for the regionalization and morphogenesis of the vertebrate gut. 
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Bacterially Expressed Hedgehog Proteins Retain Motorneuron-inducing Activity 

Various fragments of the mouse Shh gene were cloned into the pETUD vector as 
fusion proteins with a poly(His) leader sequence to facilitate purification. Briefly, fusion 
genes encoding the mature M-Shh protein (corresponding to Cys-25 through Ser-437 of SEQ 
5 ID No. 1 1) or N-terminal containing fragments, and an N-terminal exogenous leader having 
the sequence M-G-S-S-H-H-H-H-H-H-L-V-P-R-G-S-H-M were cloned in pETUD and 
introduced into E. colu The poly(His)-S/z/i fusion proteins were purified using nickel chelate 
chromatography according to the vendor's instructions (Qiagen catalog 30210), and the 
poly(His) leader cleaved from the purified proteins by treatment with thrombin. 

10 Preparations of the purified Shh proteins were added to tissue explants (neural tube) 

obtained from chicken embryos and cultured in a defined media (e.g., no serum). M-Shh 
protein was added to final concentrations of between 0.5pM to 5nM, and differentiation of 
the embryonic explant tissue to motorneuron phenotype was detected by expression of Islet- 1 
Q antigen. The bacterially produced protein was demonstrated to be active in the explant 

0 15 cultures at concentrations as low as 5 to 50pM. An Shh polypeptide containing all 19kd of 
Sf the amino terminal fragment and approximately 9kd of the carboxyl terminal fragment (see 

^ Example 6) displayed both motor neuron inducing activity and weak floor plate inducing 

p activity, indicating that these activities likely reside with the N-terminal fragment. 

s 

o 

N 20 Example 10 

P 

W Induction of Dopaminergic Neuron Phenotype with Sonic Hedgehog 

^ Hamburger-Hamilton stage 8-10 chick embryos were dissected free of the vitelline 

membranes and the areas opaca and pellucida. The embryos were then incubated in 
Dulbecco's Modified Eagle's Medium containing 0.5% dispase (Boehringer), 10 fig/ml 

25 hyaluronidase (Sigma), and 0.04% DNAse I (Sigma). The neural plate was then separated 
from its underlying mesoderm and notochord. The presumptive midbrain was identified and 
located according to its fate map (Couly and Le Douarin, 1987, Developmental Biol 
120:198-214) and isolated. The ventral one-third of the mesencephalic neural plate, 
comprising the presumptive floor plate and adjacent prospective dopaminergic neurons was 

30 then removed and discarded. The dorsal one-third was likewise dissected and removed. The 
remaining intermediate region was then incubated in vitro on a 2% agarose (Sigma) 
containing substrate made with alpha medium (Gibco). Recombinant Shh hedgehog, both 
human and mouse (full length cDNA), was then introduced to the tissue in one of two ways: 
(1) Bound to nickel-agarose beads (Qiagen) via the 6-histidine tag engineered onto the amino 

35 terminus of the protein, or (2) was incorporated in a soluble form directly into the agarose 
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substrate. Dihydrofolate reductase was used as the control protein for these experiments. 
The tissue was then incubated at 37°C for periods ranging from 36-48 hours. For analysis, 
tissue was fixed at 4°C in 4% paraformaldehyde and stored in 50% MeOH until staining. 
Staining was done for both tyrosine hydroxylase (TH) (Boehringer), L-DOPA (Chemicon), 
and dopamine (DA) (Chemicon). 

The data indicate that both mouse and human recombinant Shh hedgehog were active 
in the above described experiments. Furthermore, results indicate that addition of Shh 
induces both islet- 1 (a motor neuron marker) and TH (a catecholaminergic neuron) as well as 
the accumulation of L-DOPA in the mesencephalon, which is indicative of a dopaminergic 
phenotype. 

Example 1 1 

Sonic Hedgehog Induces Bone Formation 

The ectopic bone formation assay was essentially done as described in Sampath and 
Reddi, 1983, PNAS USA 80:6591-6595. The mouse Shh protein was frozen and lyophilized, 
and the powder was enclosed in no. 5 gelatin capsule. Alternatively, 0.9-2.0 mg of collagen 
sponge (Collastat) was used as matrix. The Shh protein (12.5 \xg) was added directly to the 
washed sponge, the sponge lyophilized, and the sponge implanted. The capsules or collagen 
sponges were implanted subcutaneously in the abdominal thoracic area of 21- to 49-day 
female Long-Evans rats and routinely removed at 1 1 days. Samples were processed for 
histological analysis, with l-fim glycolmethacrylate sections stained with Von Kossa and 
acid fuschin or toluidine blue. Von Kossa staining shows mineral (hydroxyapatite) 
formation. The collagen sponge by itself was used as a control in these experiments. The 
results indicate that the addition of mouse Shh protein induced bone formation in these rats. 



Example 12 
Patched is a receptor for Sonic Hedgehog 

(i) Experimental Procedures 

In vitro transcription of chick patched 

Chick patched coding sequences were inserted into the vector pRD67 (kindly 
provided by J. Cunningham) which contains an SP6 phage promoter and both 5' and 3 r 
untranslated sequences derived from the Xenopus laevis p-Globin gene. This vector also 
contains a. flu epitope inserted at the 3' end. After restriction endonuclease digestion with 
Xbal to generate linear templates, RNA was transcribed in vitro using SP6 RNA polymerase 
in the presence of ImM cap structure analog (m 7 G(5 t )ppp(5 , )Gm (Ambion Kit for capped 
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mRNAs) Following digestion with RQ1 DNase I (Ambion) to remove the DNA template, 
transcripts were purified by phenohcholoroform extraction and ethanol precipitation. 

Xenopus Oocyte Injection 

5 Xenopus laevis oocytes were surgically isolated and enzymatically defolliculated 

using published techniques (see, for example, Coleman et al., eds., Transcription and 
Translation: A Practical Approach. IRL Press, pp. 271-302; and Williams et al. (1988) PNAS 
85:4939-4943). Defolliculated oocytes were incubated at 19°C in media. Defolliculated 
oocytes were injected with water (control oocyte) or with 25ng of in vitro transcribed, capped 
10 chick patched or TMM13 cRNAs (prepared as described above). Injected oocytes were 
incubated for 16 hours at 19°C in media- 
Following a recovery period, healthy injected oocytes and uninjected controls were 
cultured at 19°C for 48 hours in Buffer A (50% L-15 medium (Sigma Chemicals), lmM L- 
glutamine, 15 mM HEPES pH 7.5, 100|ig/mL gentamcin). 

15 

Binding assays 

Recombinant, bacterial human sonic Hedgehog (Shh) (prepared as described in 
Example 9) was radioiodinated using the commercially available IODO-BEADS Iodination 
Reagent (Pierce, Product # 28,665) according to the manufacturer's instructions. The reagent 
20 used in this system is N-chloro-benzene sulfonamide (sodium salt) immobilized on 
nonporous, polystyrene beads. (Markwell, M.A.K. (1982) Anal Biochem. 125, 427-43) 
Reactions were carried out for 1 minute at room temperature before purifying the labeled 
protein by the manufacturer's protocol. 

Binding experiments were performed in oocytes which had been microinjected with 
25 patched or TM13 cRNAs, or uninjected controls. Defolliculated oocytes were preincubated 
in 1% Bovine Serum Albumin (BSA) in a Buffer B (5mM Tris (pH7.5), 5mM HEPES, 
lOOmM NaCl, lmM CaCl 2 , lmM MgCl 2 and 2mM KC1) for 1 hour at 4°C or room 
temperature (RT). Oocytes were incubated in the presence 125 I-labeled Shh (3nM) diluted in 
5% BSA in Buffer B for 40 minutes allowing for equilibrium to be reached. Incubated 
30 oocytes were rinsed with 1% BSA in buffer to remove unbound label. The amount of labeled 
Shh (cpm) bound to each oocyte was determined using a standard scintillation counter. 

As demonstrated by Figure 17, Shh shows saturable, concentration-dependent binding 
to the ectopically expressed patched on the oocytes. The amount of 125 I-labeled Shh bound 
per oocyte (cpm/oocyte) is indicated with respect to the concentration of labeled Shh (nM). 
35 The solid line represents binding assays using oocytes which had been microinjected with in 
vitro transcribed chick patched cRNA. The open squares represent control uninjected 
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oocytes. Saturable-, concentration-dependent binding of Shh was detected only in patched- 
expressing oocytes. Each point represents the mean ± standard deviation (SD) from five 
oocytes for the indicated concentrations of labeled Shh. 

Figure 18 shows that Shh binds to the recombinant patched in a specific manner. The 
amount of bound 125 I-labeled Shh (cpm/oocyte) was determined in Xenopus laevis oocytes 
microinjected with in vitro transcribed patched cRNA (first bar in set), uninjected control 
(second bar) and in vitro transcribed TM13 cRNA (third bar). TM13 was used as a control 
since it encodes an amino acid transporter that contains 13 membrane spanning domains. A 
significant increase in the amount of bound label was detected only in patched-exprtssmg 
oocytes. The concentration of labeled Shh used was 1 .3nM. Each point represents the mean 
+ SD of bound label (cpm) from five oocytes per condition. Asterisks indicate a statistical 
significant increase in the amount of bound label. 

Figure 19 shows the timecourse of human Shh binding to chick patched. The solid 
line and open triangles represent 125 I-labeled Shh binding to Xenopus laevis oocytes which 
had been microinjected with in vitro transcribed patched cRNA and incubated for the 
indicated times. Open squares and stars represent labeled Shh bound to control uninjected 
oocytes. The conditions represented by the solid line and open squares were performed at 
room temperature (RT). The conditions represented by the open triangles and stars v/ere 
performed at 4°C. A comparison of the binding curves at these two temperatures suggests . 
that some internalization of the label occurs at RT compared to 4°C. Thus, the lower ' 
temperature of 4°C provides an optimal condition for these assays. Each point represents the 
mean + SD of bound label (cpm) from five oocytes per condition. 

Figure 20 shows the dissociation rate of human Shh binding to chick patched 
Xenopus laevis oocytes incubated in the presence of 125 I-labeled Sh were mixed with the 
indicated concentrations of unlabeled Shh for 40 minutes at room temperature. The solid line 
represents labeled Shh binding to Xenopus laevis oocytes microinjected with in vitro 
transcribed patched cRNA. Open squares represent the nonspecific binding of l25 l-Shh to 
control uninjected oocytes. Displacement of bound labeled Shh was detected by competition 
with unlabeled protein. Each condition represents the mean bound label (cpm) from five 
oocytes ± standard deviation. 

A comparison of the fly and mouse patched sequences suggests the presence of two 
potential glycosylated, hydrophilic loops in the extracellular domain of this protein. 
(Goodrich et al. (1996), Genes & Development 10: 301-312 ) The address the effect of these 
putative glycosylation sites on Shh binding to patched, oocytes microinjected with chick 
patched cRNA were incubated with 2 jig/ml tunicamycin (New England Biolabs, Inc.). The 
expected decrease in the molecular weight of patched was detected in the tunicamycin- 
treated samples compared to the untreated controls as detected by Western Blot using the flu 
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tag antibody. As shown in Figure 21, binding of Shh is sensitive to the glycosylation state of 
patched. This figure shows bound 125 I-labeled Shh (cpm/oocyte) inXenopus laevis oocytes 
microinjected with in vitro transcribed patched cRNA (first two bars in set). The last two 
bars represent uninjected control oocytes. The concentration of labeled Shh used was 3 nM. 
The conditions shown in the first and third bars represent oocytes treated with tunicamycin (2 
Hg/ ml) for 48 hours. A significant decrease in labeled Shh binding after treatment with 
tunicamycin was detected only in the patched-expressing oocytes. Each condition represents 
the mean bound label (cpm) from five oocytes ± standard deviation. 

Figure 22 illustrates a proposed topological model of the mouse patched protein. The 
mouse patched has been proposed to have 12 transmembrane domains and two glycosylated 
extracellular hydrophilic loops. Black and lightly shaded circles indicate identical and 
similar amino acids, respectively, shared between the mouse and fly patched proteins. 
(Goodrich et al. (1 996), supra at 3 1 0 ) 

All of the above-cited references and publications are hereby incorporated by 
reference. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, numerous equivalents to the specific polypeptides, nucleic acids, 
methods, assays and reagents described herein. Such equivalents are considered to be within 
the scope of this invention and are covered by the following claims. 



